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Polvketi des and their synthesis 

Field of Invention 

The present invention relates to processes and materials (including recombinant 
strains) for the preparation and isolation of macrolide compounds, particularly compounds 
differing from natural compounds at least in terms of glycosylation. It is particularly 
concerned with erythromycin and azithromycin analogues wherein the natural sugar at the 5- 
position has been replaced. The invention includes the use of recombinant cells in which 
gene cassettes are expressed to generate novel macrolide antibiotics. 



Background to the Invention 

The biosynthetic pathways to the macrolide antibiotics produced by actinomycete 
bacteria generally involve the assembly of an aglycone structure, followed by specific 
modifications which may include any or all of: hydroxylation or other oxidative steps, 
methylation and glycosylation. In the case of the 14-membered macrolide erythromycin A 
these modifications consist of the specific hydroxylation of 6-deoxyerythronolide B to 
erythronolide B which is catalysed by EryF, followed .by the sequential attachment of 
mycarose via the hydroxyl group at C-3 catalysed by the mycarosyltransferase EryBV 
(Staunton and Wilkinson, 1997). The attachment of desosamine via the hydroxyl group at C- 
5, catalysed by EryCm, then results in the production of erythromycin D, the first 
intermediate with antibiotic activity. Erythromycin D is subsequently converted to 
erythromycin A by hydroxylation at C-12 (EryK) and O-methylation (EryG) on the 
mycarosyl group, this order being preferred (Staunton and Wilkinson, 1997). The 
biosynthesis of dTDP-L-mycarose and dTDP-D-desosamine has been studied in detail 
25 (Gaisser et al, 1997; Summers et al, 1997; Gaisser et al, 1998; Salah-Bey et al, 1998). 

Recently 3.1 A high-resolution X-ray investigation of the interaction of ribosomes 
with macrolides (SchlUnzen et al, 2001, Hansen et al, 2002) has revealed key interactions 
giving direct insights into ways in which macrolide templates might be adapted, by chemical 
or biological approaches, for increased ribosomal binding and inhibition and for improved 
effectiveness against resistant organisms. In particular, previous indications about the 
importance of the sugar substituent at the C-5 hydroxyl of the macrocycle for ribosomal 
binding are fully borne out by the structural analysis; this substituent extends towards the 
peptidyl transferase centre and in the case of 16 membered macrolides, which bear a 
disaccharide at C-5, reaches further into the peptidyl transferase centre, thus providing a 
molecular basis for the observation that 16 membered macrolides inhibit ribosomai'capacity 
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to form even a single peptide bond (Poulsen et al, 2000). This suggests that erythromycins 
with alternative substituents at the C-5 positions, for example mycaminosyl and 
angolosaminosyl erythromycins, and in particular mycaminosyl and 4<-0 substituted 
mycaminosyl erythromycins, are highly desirable as potential anti-bacterial agents. 
5 Since post-polyketide synthase modifications are often critical for biological activity 

(Liu and Thorson, 1994; Kaneko et al., 2000), there has been increasing interest in 
understanding the mechanism and specificity of the enzymes involved to engineer the 
biosynthesis of diverse novel hybrid macrolides with potentially improved activities. Recent 
work has demonstrated that the manipulation of sugar biosynthetic genes is a powerful 
1 0 approach to isolate novel macrolide antibiotics. The recently demonstrated relaxed specxficrty 
of the glycosyltransferases is crucial for this approach (see Mendez and Salas, 2001 and 
references therein). In the pathways to erythromycin A and methymycin / neomethymycm, 
the production of hybrid macrolides has been observed after inactivation of specific genes 
involved in the biosynthesis of deoxyhexoses (Gaisser * «/., 1997; Summers et al, 1997; 
Gaisser**/., 1998; Salah-Bey eta!., 1998; Zhao et al, 1998a; Zhao et al., 1998b)or after 
the expression of genes from different biosynthetic gene clusters (Zhao et al., 1999). A 
relaxed specificity towards the sugar substrate has also been reported for glycosyltransferases 
that have been expressed in heterologous strains, including glycosyltransferases from the 
pathways to vancomycin (Solenberg et al, 1997), elloramycin (Wohlert et al, 1998), 
oleandomycin (Doumith et al, 1999; Gaisser et al., 2000), pikromycin (Tang and McDamel, 
2001) epirubicin (Madduri et al, 1998), avermectin (Wohlert et al, 2001) and spinosyn 
(Gaisser et al, 2002a). Most of the successful alterations so far reported have mvolved 
relaxed specificity towards the activated sugar moiety, while as yet only isolated examples are 
known where a glycosyltransferase targets its deoxysugar to an alternative aglycone substrate 
(Spagnoli 1983; Trefzer etal, 1999). Both WO 97/23630 and WO 99/05283 descnbe 
the production of erythromycins with an altered glycosylate pattern in culture supernatants 
by deletion of a specific sugar biosynthesis gene. Thus WO 99/05283 describes low but 
detectable levels of S-O-dedesosaminyl-S-O-mycaminosyl erythromycin D in the culture 
supernatant of an eryCW knockout strain ot S erythraea. Ir aiso has been demonstrated that 
the use of the gene cassette technology described in patent WO01/79520 is a powerful and 
potentially general approach to isolate novel macrolide antibiotics by expressmg 
oombinationsof genes in mutant strains of S. (Gaisser „ a/., 2002b). WO 01/79520 

also describes the detection of S-O-dedesosaminyl-S-O-mycaminosyl erythromycin A in 
culture supernatants of the S. erythraea strains SGQ2 P SGCin and SGQ2p(mycaminose)Cin, 
35 fed with 3-O-mycarosylerymronolide B. However, the low levels of 5-0-dedesosarmnyl-5-0 
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mycaminosyl erythromycin A make this a less than optimal method for producing this 
valuable material on large scales and similar problems were encountered synthesizing 5-0- 
dedesosaminyl-5-O-mycaminosyl erythromycin A using chemical methods (Jones et al. 9 
1969). EP 1024145 refers to the isolation of azithromycin analogues carrying a mycaminosyl 
residue such as 5-O-dedesosaminyl-5-0-mycaminosyl azithromycin and 3"-desmethyl-5-0- 
dedesosaminyl-5-O-mycaminosyl azithromycin. However the only examples given in this 
area are "prophetic examples" and there is no evidence that they could actually be put into 
practice. 

Therefore, the present invention provides the first demonstration of an efficient and 
highly effective method for making significant quantities of erythromycins and azithromycins 
which have non-natural sugars at the C-5 position, in particular mycaminose and 
angolosamine. In a specific aspect the present invention provides for the synthesis of 
mycaminose and angolosamine using specific combinations of sugar biosynthetic genes in 
gene cassettes. 

Summary of the Invention 

The present invention relates to processes, and recombinant strains, for the 
preparation and isolation of erythromycins and azithromycins, which differ from the 
corresponding naturally occurring compound in the glycosylation of the C-5 position. In 
particular, the present invention relates to processes and recombinant strains for the 
preparation and isolation of 5-0-dedesosaminyl-5-0-mycaminosyl, or angolosaminyl 
erythromycins and azithromycins, in particular 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycins and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycins, and specifically 5- 
O-dedesosaminyl-5-O-mycaminosyl erythromycin B, 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin C, 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin D, 5-0-dedesosaminyl- 
5 - 0-my caminosy 1 erythromycin A, and 5-O-dedesosaminyl-5-0-mycaminosyl azithromycin. 
The present invention further relates to novel 5-0-dedesosaminyl-5-0-mycaminosyl, 
angolosaminyl erythromycins and azithromycins produced thereby. 

Detailed description of the Invention 

The present invention relates to processes, and recombinant strains, for the 
preparation and isolation of erythromycins and azithromycins which differ from the naturally 
occurring compound in the glycosylation of the C-5 position. These are referred to herein as 
"compounds of the invention" and unless the context dictates otherwise, such a reference 
includes a reference to 5-0-dedesosaminyl-5-O-mycaminosyl erythromycins, 5-0- 



dedesosaminyl-5-O-angolosaminyl erythromycins, 5-0-dedesosaminyI-5-0-mycaminosyl 
azithromycins, and 5-O-dedesosaminyl-5-0-angolosaminyl azithromycins, specifically 5-0- 
dedesosaminyl-5-O-mycaminosyl erythromycin A, 5-0-dedesosaminyl-5-(9-mycaminosyl 
erythromycin C, 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B, 5-0-dedesosaminyl- 
5-0-mycaminosyl erythromycin D, 5-O-dedesosaminyl-S-O-mycaminosyl azithromycin, 5-0- 
dedesosaminyl-5-O-angolosaminyl erythromycin A, 5-0-dedesosaminyl-5-0-angolosaminyl 
erythromycin B, S-O-dedesosaminyl-5-O-angolosaminyl erythromycin C, 5-0- 
dedesosaminyl-5-O-angoIosaminyl erythromycin D, 5-0-dedesosaminyl-5-0-angolosaminyl 
azithromycin and analogues thereof which additionally vary in glycosylation at the C3 
position (see WO 01/79520) and which may also vary in the aglycone backbones (see WO 
98/01571, EP 1024145, WO 93/13663, WO 98/49315). The invention relates to processes, 
and recombinant strains, for the preparation and isolation of compounds of the invention. The 
present invention further relates to novel 5-0-dedesosaminyl-5-0-angolosaminyl 
erythromycins and azithromycins produced thereby (Figure 1). The methodology comprises 
in part the expression of a gene cassette in the S. erythraea mutant strain SGQ2 (which carries 
genomic deletions in eryA, eryCIII, eryBVand eryCW (WOO 1/79520)), as described in 
Example 3 and 6 and in S. erythraea Q42/1 (BIOT-2166) (Examples 1- 4) and S. erythraea 
18A1 (BIOT-2634) (Example 6). Detailed descriptions are given in Examples 1-11. 

The invention relates to a process involving the transformation of an actinomycete 
strain, including but not limited to strains of S. erythraea such as SGQ2, (see WO 01/79520) 
or Q42/1 or 18A1 (whose preparation is described below) with an expression plasmid 
containing a combination of genes which are able to direct the biosynthesis of a sugar moiety 
and direct its subsequent transfer to an aglycone or pseudoaglycone. 

In a particular embodiment the present invention relates to a gene cassette containing 
a combination of genes which are able to direct the synthesis of mycaminose in an appropriate 
strain background. The gene cassette may include genes selected from but not limited to 
angorfU, tylMEI, tylM, tylB, tylAI, tylAII, tylla, angAI, angAII, angMUI, angB, angAH, 
eryG, eryK and glycosyltransferase genes including but not limited to tylMII, angMH, desVII, 
eryCIII, eryBV, spnP, and midl. In a preferred embodiment the gene cassette comprises 
angorfl4 in combination with one or more other genes which are able to direct the synthesis 
of mycaminose. In an more preferred embodiment the gene cassette comprises angAI, 
angAII angorfl4, angMUI, angB, angAII, in combination with one or more 
glycosyltransferases such as but not limited to eryCIII, tylMII, angAHI, In an alternative 
embodiment the gene cassette comprises tylAI, tylAII, tylMUI, tylB, tylla, tylMI m 



combination with glycosyltransferases such as but not limited to eryCIII, tylMII and angMII. 
In a preferred embodiment the strain is an S. erythraea strain. 

In a particular embodiment the present invention relates to a gene cassette containing 
combinations of genes which are able to direct the synthesis of angolosamine, including but 
not limited to angMII, angMI, angB, angAI, angAII, angorfl4 t angorf4, tylMUI, tylMI, tylB, 
tylAI, tylAII 9 eryCVI, spnO, eryBVI, and eryK and one or more glycosyltransferase genes 
including but not limited to eryCIII, tylMII, angMII desVII, eryBV, spnP and midl. In a 
preferred embodiment the gene cassette contains angMII angMI, angB, angAI, angAII 
angorfl4, spnO in combination with a glycosyltransferase gene such as but not limited to 
angMII tylMII or eryCIII. In a preferred embodiment the strain is an S. erythraea strain. 

In one embodiment, the process of the present invention further involves feeding of 
an aglycone and/or a pseudoaglycone substrate (for definition see below), including (but not 
limited to) 3-O-mycarosyl eiythronolide B, erythronolide B, 6-deoxy erythronolide B, 3-0- 
mycarosyl-6-deoxy erythronolide B, tylactone, spinosyn pseudoaglycone, 3-0-rhamnosyl 
erythronolide B, 3-0-rhamnosyl-6-deoxy erythronolide B to cultures of the transformed 
actinomycete strains, the bioconversion of the substrate to compounds of the invention and 
optionally the isolation of said compounds. This process is exemplified in Examples 1-11. 
However, a person of skill in the art will appreciate that in an alternative embodiment the host 
cell can express the desired aglycone template, either naturally or recombinantly. 

As used herein, the term "pseudoaglycone" refers to a partially glycosylated 
intermediate of a multiply-glycosylated product. 

Those skilled in the art will appreciate that alternative host strains can be used. A 
preferred cell is a prokaryote or a fungal cell or a mammalian cell. A particularly preferred 
host cell is a prokaryote, more preferably host cell strains such as actinomycetes, 
Pseudomonas, myxobacteria, and E. coli. It will be appreciated that if the host cell does not 
naturally produce erythromycin, or a closely related 14-membered macrolide, it may be 
necessary to introduce a gene conferring self-resistance to the macrolide product, such as 
ermE from S. erythraea. Even more preferably the host cell is an actinomycete, even more 
preferably strains that include but are not limited to S. erythraea, Streptomyces griseofuscus, 
Streptomyces cinnamonensis, Streptomyces albus, Streptomyces lividans, Streptomyces 
hygroscopicus sp. y Streptomyces hygroscopicus var. ascomyceticus 9 Streptomyces 
longisporoflavus, Saccharopolyspora spinosa, Streptomyces tsukubaensis, Streptomyces 
coelicolor, Streptomyces fradiae, Streptomyces rimosus, Streptomyces avermitilis, 
Streptomyces eurythermus, Streptomyces venezuelae, Amycolatopsis mediterranei. In a more 
highly preferred embodiment the host cell is S. erythraea. 



It will readily occur to those skilled in the art that the substrate fed to the recombinant 
cultures of the invention need not be a natural intermediate in erythromycin biosynthesis. 
Thus, the substrate could be modified in the aglycone backbone (see Examples 8-1 1) or in the 
sugar attached at the 3-position or both. WO 01/79520 demonstrates that the desosaminyl 
5 transferase EryCIII exhibits relaxed specificity with respect to the pseudoaglycone substrate, 
converting 3-0-rhamnosyl erythronolides into the corresponding 3-0-rhamnosyl 
erythromycins. Appropriate modified substrates may also be produced by chemical semi- 
synthetic methods. Alternatively, methods of engineering the eiythromycin-producing 
polyketide synthase, DEBS, to produce modified erythromycins are well known in the art (for 

10 example WO 93/13663, WO 98/01571, WO 98/01546, WO 98/493 15, Kato, Y. et al, 2002 ). 
Likewise, WO 01/79520 describes methods for obtaining erythronolides with alternative 
sugars attached at the 3-position. Therefore, the term "compounds of the invention" includes 
all such non-natural aglycone compounds as described previous additionally with alternative 
sugars at the C-5 position. All these documents are incorporated herein by reference. 

15 It will readily occur to those skilled in the art that the compounds of the invention 

containing a mycaminosyl moiety at the C-5 position could be modified at the C4 hydroxyl 
group of the mycaminosyl moiety, including but not limited to glycosylation (see also WO 
01/79520), acylation or chemical modification. 

The present invention thus provides variants of erythromycin and related macrolides 

20 having at the 5-position a non-naturally occurring sugar, in particular an O-mycaminosyl, or 
angolosaminyl residue or a derivative or precursor thereof, specifically an 0- angolosaminyl 
residue or a derivative thereof. 

The term "variants of erythromycin" encompasses (a) erythromycins A, B, C and D; 
(b) semi-synthetic derivatives such as azithromycin and other derivatives as discussed in EP 

25 1024145, which is incorporated herein by reference; (c) variants produced by genetic 

engineering and semi-synthetic derivatives thereof. Variants produced by genetic engineering 
include variants as taught in, or producible by, methods taught in WO 98/01571, EP 1024145, 
WO 93/13663, WO 98/49315 and WO 01/79520 which are incorporated herein by reference. 
The compounds of the invention include variants of erythromycin where the natural sugar at 

30 position C5 has been replaced with mycaminose or angolosamine and also includes 

compounds of the following formula (1) and pharmaceutically acceptable salts thereof. No 
stereochemistry is shown in Formula 1 as all possibilities are covered, including "natural" 
stereochemistries (as shown elsewhere in this specification) at some or all positions. 

35 Formula I: 
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R 2 
R 1 




OR 13 



R 6 
OR ls 



R 2 ' 
R 1 




OR 13 



R 6 

OR 15 



R ,= H, CH 3 , C2H 5 or selected from i) see below 

R 2 , R 4 , R 5 , R 6 , R 7 and R 9 are each independently H, OH, CH 3 , or OCH 3 
R 3 = H or OH 
R 8 = Hor 

OR 10 



R 16 = HorOH 

P. 14 = H or -C(0)NR c R d wherein each of R c and R d is independently H, C,-C 10 aikyl, C 2 -C 20 
alkenyl, C 2 -C, 0 alkynyl, -(CH 2 ) m (C 6 -C, 0 aryl), or-(CH 2 ) m (5-10 membered heteroaryl), 
wherein ra is an integer ranging from 0 to 4, and wherein each of the foregoing R c and R d 
groups, except H, may be substituted by 1 to 3 Q groups; or wherein R c and R d may be taken 
together to form a 4-7 membered saturated ring or a 5- 1 0 membered heteroaryl ring, wherein 
said saturated and heteroaryl rings may include 1 or 2 heteroatoms selected from O, S and N, 
in addition to the nitrogen to which R c and R d are attached, and said saturated ring may 




or selected from iv) see below 



R 10 =HorCH 3 oracyl 
R n = H or I 




R 12 = H oracyl 
R 13 = HorCH, 
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include 1 or 2 carbon-carbon double or triple bonds, and said saturated and heteroaryl rings 
may be substituted by 1 to 3 Q groups; or R 2 and R 17 taken together form a carbonate ring; 
each Q is independently selected from halo, cyano, nitro, trifluoromethyl, azido, -C(0)Q l , - 
OC(0)Q\ -C(0)OQ', -OC(0)OQ l , -NQ 2 C(0)Q 3 , -C(0)NQ 2 Q 3 , -NQ 2 Q 3 , hydroxy, d-C 6 
alkyl, C,-C 6 alkoxy, -(CH 2 ) m (C 6 -C 10 aryl), and -CCH 2 ) m (5-10 membered heteroaryl), wherein 
m is an integer ranging from 0 to 4, and wherein said aryl and heteroaryl substituents may be 
substituted by 1 or 2 substituents independently selected from halo, cyano, nitro, 
trifluoromethyl, azido, -CCOQ 1 , -OCOOQ 1 , -OC(0)OQ ! , -NQ 2 C(0)Q 3 , -C(0)NQ 2 Q 3 , - 
NQ 2 Q 3 , hydroxy, C,-C 6 alkyl, and C,-C 6 alkoxy; 

each Q\ Q 2 and Q 3 is independently selected from H, OH, C,-C 10 alkyl, C,-C 6 alkoxy, 
C 2 -C,o alkenyl, C 2 -C I0 alkynyl, -(CH 2 )m(C6-C 10 aryl), and -(CH 2 ) m (5-10 membered 
heteroaryl), wherein m is an integer ranging from 0 to 4; with the proviso that the compound 
is not 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A or D. 

The present invention also provides compounds according to formula I above in which: 
i) the substituent R 1 is selected from 

- an alpha-branched C 3 -C 8 group selected from alkyl. alkenyl, alkynyl, 
alkoxyalkyl and alkylthioalkyl groups any of which may be optionally 
substituted by one or more hydroxyl groups; 

- a C 5 -C 8 cycloalkylalkyl group wherein the alkyl group is an alpha-branched 

20 C 2 -C 5 alkyl group 

- a C 3 -C 8 cycloalkyl group or C 5 -C 8 cycloalkenyl group, either of which may 
optionally be substituted by one or more hydroxyl, or one or more C1-C4 
alkyl groups or halo atoms 

- a 3 to 6 membered oxygen or sulphur containing heterocyclic ring which may 
25 be saturated, or fully or partially unsaturated and which may optionally be 

substituted by one or more C,-C 4 alkyl groups, halo atoms or hydroxyl groups. 

- phenyl which may be optionally substituted with at least one substituent 
selected from C,-C 4 alkyl, C r C 4 alkoxy and C,-C 4 alkylthio groups, halogen 
atoms, Uuiuoioiuetliyl; as3=cycno-or 

30 . r» is R 17 -CH 2 - where R 17 is H, C,-C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, 

alkoxyalkyl or alkylthioalkyl containing from 1 to 6 carbon atoms in each 
alkyl or alkoxy group wherein any of said alkyl, alkoxy, alkenyl or alkynyl 
groups may be substituted by one or more hydroxyl groups or by one or more 
halo atoms; or a C 3 -C 8 cycloalkyl or C 5 -C 8 cycloalkenyl either of which may 

35 be optionally substituted by one or more C,-C 4 alkyl groups or halo atoms; or 
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a 3 to 6 membered oxygen or sulphur containing heterocyclic ring which may 
be saturated or folly or partially unsaturated and which may optionally be 
substituted by one or more C1-C4 alkyl groups or halo atoms; or a group of 
the formula SAi 6 wherein A 16 is C,-C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, 

5 C 3 -C 8 cycloalkyl, C 5 -C 8 cycloalkenyl, phenyl or substituted phenyl wherein 

the substituent is C1-C4 alkyl, C1-C4 alkoxy or halo, or a 3 to 6 membered 
oxygen or sulphur-containing heterocyclic ring which may be saturated, or 
fully or partially unsaturated and which may optionally be substituted by one 
or more C1-C4 alkyl groups or halo atoms 

10 ii) the substituents R 2 , R 4 , R s , R 6 , R 7 and R 9 are each, independently, H, OH, CH 3 , 

CjHs, OCH3 

iii) the -CHOH- at CI 1 (erythromycins) or C12 (azithromycins) is replaced by a 
methylene group (-CH2-), a keto group (C=0), or by a 10,1 1-olefinic bond 
(erythromycins) or 11,12-olefinic bond (azithromycins) 
1 5 iv) R 8 includes but is not limited to rhamnose, 2'-0-methyl rhamnose, 2',3'-bis-0- 

methyl rhamnose, 2\3',4 , -tri-0-methyl rhamnose, oleandrose, oliose, digitoxose 
or olivose 

v) the substituent R n is H or mycarose or C4-0-acyl-mycarose or glucose 
The present invention also provides compounds according to formula I above which 
20 differ in the oxidation state of one or more of the ketide units (i.e. selection of alternatives 

from the group: -CO-, -CH(OH)-, alkene -CH-, and CH 2 ) where the stereochemistry of any - 
CH(OH)- is also independently selectable. 

Novel 5-0-dedesosaminyl-5-0-angolosaminyl erythromycins and azithromycins 
made available by this aspect of the invention include, but are not limited to those where in 
25 the R 15 group R U = R ,6 =H, with the proviso that they are not angolamycin or medermycin 
(Kinumaki and Suzuki, 1972; Ichinose et al., 2003). 

Additionally, a person of skill in the art will appreciate that using the methods of the 
present invention mycaminose and angolosamine may be added to other aglycones or 
pseudoaglycone for example (but without limitation) tylactone or spinosyn pseudoaglycone. 
30 These other aglycones or pseudoaglycones may be the naturally occurring structure or they 
may be modified in the aglycone backbone, such modified substrates may be produced by 
chemical semi-synthetic methods (Kaneko et al., 2000 and references cited therein), or, 
alternatively, via PKS engineering, such methods are well known in the art (for example WO 
93/13663, WO 98/01571, WO 98/01546, WO 98/49315, Kato, Y. etal., 2002) ). 
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Moreover, the process of the host cell selection further comprises the optional step of 
deleting or inactivating or adding or manipulating genes in the host cell. This process 
comprises the improvement of recombinant host strains for the preparation and isolation of 
compounds of the invention, in particular 5-0-dedesosaminyl-5-0-mycaminosyl 
5 erythromycins and 5-O-dedesosaminyl-5-0-mycaminosyl azithromycins, specifically 5-0- 
dedesosaminyl-5-0-mycaminosyl erythromycin A, 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin C, 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B, 5-0-dedesosaminyl- 
5-0-mycaminosyl erythromycin D and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin. 
This approach is exemplified in Example 1 by introducing an etyBVI mutation into the 

10 chromosome of S. erythraea SGQ2 in order to optimise the conversion of the substrate 3-0- 
mycarosyl erythronoiide B to 5-0-dedesosaminyl-5-0-mycaminosyl erythromycins. 

In a further aspect the invention relates to the construction of gene cassettes. The 
cloning method used to isolate these gene cassettes is analogous to that used in 
PCT/GB03/003230 and diverges significantly from the approach previously described (WO 

15 01/79520) by assembling the gene cassette directly in an expression vector rather than pre- 
assembling the genes in pUC18/19 plasmids, thus providing a more rapid cloning procedure 
for the isolation of gene cassettes. The strategy for isolating these gene cassettes is 
exemplified in Example 1 to Example 1 1 . A schematic overview of the strategy is given in 
Figure 2. 

20 Another aspect of the invention allows the enhancement of gene expression by 

changing the order of genes in a gene cassette, the genes including but not limited to tylMI, 
tylMUI, tylB, eryCVI, tylAI, tylAII, eryCIII, eryBV t cmgAI, angAII, angMUI, angB, angMT t 
angorfl4, angor/4, eryBVI, eryK, eryG, angMII, tylMII, desVII„midI, spnO, spnN, spnP and 
genes with similar functions, allowing the arrangement of the genes in a multitude of 

25 permutations (Figure 2). 

The cloning strategy outlined in this invention also allows the introduction of a 
histidine tag in combination with a terminator sequence 3 * of the gene cassette to enhance 
gene expression (see Example 1). Those skilled in the art will appreciate other terminator 
sequences well known in the art could be used. See, for example Bussiere and Bastia (1999), 

30 Bertram et al (2001) and Kieser et al (2000), incorporated herein by reference. 

Another aspect of the invention comprises the use of alternative promoters such as 
ptipA (Ali et al, 2002) and/or ptr (Salah-Bey et al, 1995) to express genes and/or assembled 
gene cassette(s) to enhance expression. 

Another aspect of the invention describes the multiple uses of promoter sequences in 

35 the assembled gene cassette to enhance gene expression as exemplified in Example 6. 
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Another aspect of the invention describes the addition of genes encoding for aNDP- 
glucose-synthase such as tyLUaad aNDP-glucose-4,6-dehydratase such as tylAII to the gene 
cassette in order to enhance the endogenous production of the activated sugar substrate. Those 
skilled in the art will appreciate that alternative sources of equivalent sugar biosynthetic 
pathway genes may be used. In this context alternative sources include but are not limited to: 
TvlAI- homologues : Desm of Streptomyces venezuelae (accession no AAC68682), 
GrsD of Streptomyces griseus (accession no AAD31799), AveBffl of Streptomyces 
avermitilis (accession no BAA84594), Gtt of Saccharopolyspora spinosa (accession 
no AAK83289), SnogJ of Streptomyces nogalater (accession no AAF01820), AclY of 
Streptomyces galilaeus (accession no BAB72036), LanG of Streptomyces cyanogenus 
(accession no AAD13545), Graorfl6(GraD) of Streptomyces violaceoruber 
(accession no AAA99940), OleS of Streptomyces antibioticus (accession no 
AAD5S453) and StrD of Streptomyces griseus (accession no A26984) and AngAI of 
S. eurythermus, 

TvlAII- homologues : AprE of Streptomyces tenebrarius (accession no AAG18457), 
GdH of £ spinosa (accession no AAK83290), DesIV of £ venezuelae (accession no 
AAC68681), GdH of £ erythraea (accession no AAA6821 1), AveBII of £ 
avermitilis (accession no BAA84593), ScfBl .08C of Streptomyces coelicolor 
(accession no CAB61555), LanH of & cyanogenus (accession no AAD13546), 
Graorfl7 (GraE) of £ violaceoruber (accession no S58686), OleE of £ antibioticus 
(accession no AAD55454), StrE of S. griseus (accession no P29782) and AngAII of 
£ eurythermus. 

Similarly, alternative sources for activated sugar biosynthesis gene homologues to 
tylMni angAIII, eryCII, tylMU, angMU, tylB, angB, eryCI, tylM 9 angMI, eryCVI, tylla, 
angorfl4, angorf4 t spnO, eryBVI, eryBV, eryCIU, desVII, rnidl spnN andspnP will readily 
occur to those skilled in the art, and can be used. 

Another aspect of the invention describes the use of alternative glycosyltransferases 
in the gene cassettes such as EryCHI. Those skilled in the art will appreciate that alternative 
glycosyltransferases may be used. In this context alternative glycosyltransferases include but 
are not limited to: TylMH (Accession no CAA57472), DesVH (Accession noAAC68677), 
MegCffl (Accession no AAG13921), MegDI (Accession no AAG13908) or AngMU of S. 
eurythermus. 

In one aspect of the present invention, the gene cassette may additionally comprise a 
chimeric glycosyltransferase (GT). This is particularly of benefit where the natural GT does 
not recognise the combination of sugar and aglycone that is required for the synthesis of the 
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desired analogue. Therefore, in this aspect the present invention specifically contemplates the 
use of a chimearic GT wherein part of the GT is specific for the recognition of the sugar 
whose synthesis is directed by the genes in said expression cassette when expressed in an 
appropriate strain background and part of the GT is specific for the aglycone or 
5 pseudoaglycone template (Hu and Walker, 2002). 

Those skilled in the art will appreciate that different strategies may be used for the 
introduction of gene cassettes into the host strain, such as site-specific integration vectors 
(Smovkina et aL, 1990; Lee et aL, 1991; Matsuura et aL, 1996; Van Mellaert et aL, 1998; 
Kieser et aL, 2000). Alternatively, plasmids containing the gene cassettes may be integrated 

10 into any neutral site on the chromosome using homologous recombination sites. Further, for 
a number of actinomycete host strains, including S. erythraea, the gene cassettes may be 
introduced on self-replicating plasmids (Kieser et aL, 2000; WO 98/01571). 

A further aspect of the invention provides a process for the production of compounds 
of the invention and optionally for the isolation of said compounds. 

15 A further aspect of the invention is the use of different fermentation methods to 

optimise the production of the compounds of the invention as exemplified in Example 1 . 
Another aspect of the invention is the addition of ery genes such as eryK and/or eryG into the 
gene cassette. One skilled in the art will appreciate that the process can be optimised for the 
production of a specific erythromycin (i.e. A, B, C, D) or azithromycin by manipulation of the 

20 genes eryG (responsible for the methylation on the mycarose sugar) and/or eryK (responsible 
for hydroxylation at CI 2). Thus, to optimise the production of the A-form, an extra copy of 
eryK may be included into the gene cassette. Conversely, if the erythromycin B analogue is 
required, this can be achieved by deletion of the eryK gene from the 51 erythraea host strain, 
or by working in a heterologous host in which the gene and/or its functional homologue, is 

25 not present Similarly, if the erythromycin D analogue is required, this can be achieved by 
deletion of both eryG and eryK genes from the S. erythraea host strain, or by working in a 
heterologous host in which both genes and/or their functional homologues are not present. 
Similarly, if the erythromycin C analogue is required, this can be achieved by deletion of the 
eryG gene from the S. erythraea host strain, or by working in a heterologous host in which the 

30 gene and/or its functional homologues are not present. 

In this context a preferred host cell strain is a mammalian cell strain, fungal ceils 
strain or a prokaryote. More preferably the host cell strain is Pseudomonas, mxyobacteria or 
E. coli. In a more preferred embodiment the host cell strain is an actinomycete, still more 
preferably including, but not limited to Saccharopolyspora erythraea, Streptomyces 

35 coelicolor, Streptomyces avermitilis, Streptomyces griseofuscus, Streptomyces 
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cinnamonensis, Streptomyces fradiae, Streptomyces eurythermus, Streptomyces 
longisporqflavus, Streptomyces hygroscopicus, Saccharopolyspora spinosa, Micromonospora 
griseorubida, Streptomyces lasaliensis, Streptomyces venezuelae, Streptomyces antibioticus, 
Streptomyces lividans, Streptomyces rimosus, Streptomyces albus, Amycolatopsis 
mediterranei, Nocardia sp, Streptomyces tsukubaensis and Actinoplcmes sp. N902-109. In a 
still more preferred embodiment the host cell strain is selected from Saccharopolyspora 
erythraea, Streptomyces griseofuscus, Streptomyces cinnamonensis, Streptomyces albus, 
Streptomyces lividans, Streptomyces hygroscopicus sp., Streptomyces hygroscopicus var. 
ascomyceticus, Streptomyces longisporqflavus, Saccharopolyspora spinosa, Streptomyces 
tsukubaensis, Streptomyces coelicolor, Streptomyces fradiae, Streptomyces rimosus, 
Streptomyces avermitilis, Streptomyces eurythermus, Streptomyces venezuelae, Amycolatopsis 
mediterranel In the most highly preferred embodiment the host strain is Saccharopolyspora 
erythraea. 

The present invention provides methods for the production and isolation of 
compounds of the invention, in particular of erythromycin and azithromycin analogues which 
differ from the natural compound in the glycosylation of the C-5 position, for example but 
without limitation: novel 5-0-dedesosaminyl-5-O-mycaminosyl or angolosaminyl 
erythromycins and 5-0-dedesosaminyl-5-0-mycaminosyl, or angolosaminyl azithromycins 
which are useful as anti-microbial agents for use in human or animal health. 

In further aspects the present invention provides novel products as obtainable by any 
of the processes disclosed herein. 

Brief description of Figures 

Figure J A: Structures of 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A, 5-0- 
dedesosaminyl-5-O-mycaminosyl erythromycin B and 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin C. 

Figure IB: Structure of 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin. 

Figure 2: Schematic overview over the gene cassette cloning strategy. Vector pSG144 

was derived from vector pSG142 (Gaisser et aL, 2000). Abbreviations: darn : DNA isolated 
from dam' strain background, XbaF 1 *: Xbal site sensitive to Dam methylation, eryRHS: DNA 
fragment of the right hand side of the ery-cluster as described previously (Gaisser et aL, 
2000). 
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Figure 3: Amino acid comparison between the published sequence of TylAl (below) and the 
amino acid sequence detected from the sequencing data described in this invention (above). 
The changes in the amino acid sequence are underlined. 

5 Figure 4: Amino acid comparison between the published sequence of TylAII (below) and the 
amino acid sequence detected from the sequencing data described in this invention (above). 
The changes in the amino acid sequence are underlined. 
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Figure 5: Structure of 5-0-angolosaminyl tylactone. 

Figure 6: Shows an overview of the angolamycin polyketide synthase gene cluster. 

Figure 7: The DNA sequence which comprises orfl4 and or/15 (angB) from the 

angolamycin gene cluster. 

Figure 8: The DNA sequence which comprises or/2 (angAI), or/3 (angAII) and orf4 

from the angolamycin gene cluster. 

Figure 9: The DNA sequence which comprises or/1 * (angMIII), or/2* (angMII) t an& 

20 or/3* (angMI) from the angolamycin gene cluster. 

Figure 10: The amino acid sequence which corresponds to or/2 (angAI). 

Figure 1 1: The amino acid sequence which corresponds to or/3 (angAII). 

25 

Figure 12: The amino acid sequence which corresponds to or/4. 

Figure 13: The amino acid sequence which corresponds to or/14. 

30 Figure 14: The amino acid sequence which corresponds to orfl5 (angB). 

Figure 15: The amino acid sequence which corresponds to or/1* (angMIII). 

Figure 16: The amino acid sequence which corresponds to or/2* (angMU). 

35 
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Figure 17: The amino acid sequence which corresponds to or/3* (angMI). 
General Methods 

Escherichia coli XLl-Blue MR (Stratagene), E. coli DH10B (GibcoBRL) and E. coli 
ET12567 were grown in 2xTY medium as described by Sambrook et aL 9 (1989). Vector 
pUCl 8, pUC19 and Litmus 28 were obtained from New England Biolabs. E. coli 
transformants were selected with 100 |ig/ml ampicillin. Conditions used for growing the 
Saccharopolyspora erythraea NRRL 2338-red variant strain were as described previously 
(Gaisser etal 9 1997, Gaisser et al 9 1998). Expression vectors in S. erythraea were derived 
from plasmid pSG142 (Gaisser et al. 9 2000). Plasmid-containing S. erythraea were selected 
with 25-40 \ig/ml thiostrepton or 50 ng/ml apramycin. To investigate the production of 
antibiotics, S. erythraea strains were grown in sucrose-succinate medium (Caffrey et aL 9 
1992) as described previously (Gaisser et al 9 1997) and the cells were harvested by 
centrifugation. Chromosomal DNA of Streptomyces rochei ATCC21250 was isolated using 
standard procedures (Kieser et al. 9 2000). Feedings of 3-0-mycarosyI erythronolide B or 
tylactone were carried out at concentrations between 25 to 50 mg /l. 

DNA manipulation and sequencing 

DNA manipulations, PCR and electroporation procedures were carried out as 
described in Sambrook et al 9 (1989). Protoplast formation and transformation procedures of 
S. erythraea were as described previously (Gaisser et al. 9 1997). Southern hybridizations were 
carried out with probes labelled with digoxigenin using the DIG DNA labelling kit 
(Boehringer Mannheim). DNA sequencing was performed as described previously (Gaisser et 
aL 9 1997), using automated DNA sequencing on double stranded DNA templates with an ABI 
Prism 3700 DNA Analyzer. Sequence data were analysed using standard programs. 

Extraction and mass spectrometry 

1 ml of each fermentation broth was harvested and the pH was adjusted to pH 9. For 
extractions an equal volume of ethyl acetate, methanol or acetonitrile was added, mixed for at 
least 30 min and centrifuged. For extractions with ethyl acetate, the organic layer was 
evaporated to dryness and then re-dissolved in 0.5 ml methanol. For methanol and acetonitrile 
extractions, supernatant was collected after centrifugation and used for analysis. High 
resolution spectra were obtained on a Bruker BioApex II FT-ICR (Bruker, Bremen, FRG). 
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Analysis of culture broths 

An aliquot of whole broth (1 ml) was shaken with CH 3 CN (1 ml) for 30 minutes. The mixture 
was clarified by centrifugation and the supernatant analysed by LCMS. The HPLC system 
comprised an Agilent HP1 100 equipped with a Luna 5 Jim C18 BDS 4.6 x 250 mm column 
5 (Phenomenex, Macclesfield, UK) heated to 40°C. The gradient elution was from 25% mobile 
phase B to 75% mobile phase B over 19 minutes at a flow rate of 1 ml/min. Mobile phase A 
was 10% acetonitrile: 90% water, containing 10 mM ammonium acetate and 0.15% formic 
acid, mobile phase B was 90% acetonitrile: 10% water, containing 10 mM ammonium acetate 
and 0.15% formic acid. The HPLC system described was coupled to a Bruker Daltonics 
10 Esquire3000 electrospray mass spectrometer operating in positive ion mode. 

Extraction and purification protocol: 

For NMR analysis of 5 - O-dedesosaminy 1-5 - 0-mycaminosyl erythromycin A the 
fermentation broth was clarified by centrifugation to provide supernatant and cells. The 

15 supernatant was applied to a column (16x15 cm) of Diaion® HP20 resin (Supelco), washed 
with 10% Me 2 CO/H 2 0 (2x21) and then eluted with MeaCO (3.5 1). The cells were mixed to 
homogeneity with an equal volume of Me 2 CO/MeOH (1:1). After at least 30 minutes the 
slurry was clarified by centrifugation and the supernatant decanted. The pelleted cells were 
similarly extracted once more with MeaCO/MeOH (1:1). The cell extracts were combined 

20 with the MejCO from the HP20 column and the solvent was removed in vacuo to give an 

aqueous concentrate. The aqueous was extracted with EtOAc (3 x) and the solvent removed 
in vacuo to give a crude extract The residue was dissolved in CH 3 CN/MeOH and purified by 
repeated rounds of reverse phase (CI 8) high performance liquid chromatography using a 
Gilson HPLC, eluting a Phenomenex 21.2 x 250 mm Luna 5 pm CI 8 BDS column at 21 

25 ml/min. Elution with a linear gradient of 32.5% B to 63% B was used to concentrate the 

macrolides followed by isocratic elution with 30% B to resolve the individual erythromycins. 
Mobile phase A was 20 mM ammonium acetate and mobile phase B was acetonitrile. 
High resolution mass spectra were acquired on a Bruker BioApex II FTICR (Bruker, Bremen, 
Germany). 

30 

For NMR analysis of 5-Oangolosaminyl tylactone bioconversion experiments were 
performed as previously described with four 2 1 flasks containing each 400 ml of SSDM 
medium inoculated with 5% of pre-cultures. Feedings with tylactone were carried out at 50 
mg/l. The culture was centrifuged and the pH of the supernatant was adjusted to about pH 9 
35 followed by extractions with three equal volumes of ethyl acetate. The cell pellet was 
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extracted twice with equal volumes of a mixture of acetone-methanol (50:50, vol/vol). The 
extracts were combined and concentrated in vacuo. The resulting aqueous fraction was 
extracted three times with ethyl acetate and the extracts were combined and evaporated until 
dryness. This semi purified extract was dissolved in methanol and purified by preparative 
HPLC on a Gilson 315 system using a 21 mm x 250 mm Prodigy ODS3 column 
(Phenomenex, Macclesfield, UK). The mobile phase was pumped at a flow rate of 21 ml/min 
as a binary system consisting of 30% CH 3 CN, 70% H 2 0 increasing linearly to 70% CH 3 CN 
over 20 min. 



Sequence Information 

Table 7- Sequence information for the anmlo samine biosvnthetic genes included in the gene 
cassettes 



Gene (named according to tyl 
equivalent) 


Bases in Figure 


Corresponding polypeptide 
Figure number 


or/2 (angAI) 


14847-1573 lc from Figure 8 


Figure 10 

NDP-hexose synthase 


or/3 (angAII) 


13779-14774c from Figure 8 


Figure 1 1 

NDP-hexose 4,6-dehydratase 


or/4 

(N-part) 
(C-part) 


1 1306-13666c from Figure 8 


Figure 12 

typell thioesterase 

NDP-hexose 2,3-dehydratase 


or/74 


1 162-2 160c from Figure 7 


Figure 13 

NDP-hexose 4-ketoreductase 


or/15 (angB) 


33-1 151c from Figure 7 


Figure 14 

NDP-hexoseaminotransferase 


or/J* {angMUI) 


59800-61 140 from Figure 9 


Figure 15 

Hypothetical NDP hexose 3,4 
isomerase 


or/2* {angMH) 


61 159-62430 from Figure 9 


Figure 16 

angolosaminyl glycosyl 
transferase 


or/3* (angMT) 


62452-63171 from Figure 9 


Figure 17 

N,N-dimethyl transferase 



Note : c indicates that the gene is encoded by the complement DNA strand 
potential functions of the predicted polypeptides (SEQ ID No.7 to 34) were obtained 
from the NCBI database using a BLAST search. 
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Example 1: Byconversion of 3-0-mycarosyl erythronolide B to 5-0- 
dedesosaminyl-5-O-mycaminosyl erythromycins using gene cassette 
pSG144fylAItylAIItylMintylBtylIatylMIeryCIII. 

Isolation ofpSG143 

Plasmid pSG142 (Gaisser et al., 2000) was digested with Xbal and a fill-in reaction 
was performed using standard protocols. The DNA was re- ligated and used to transform E. 
colt DH10B. Construct pSG143 was isolated and the removal of thecal site was confirmed 
by sequence analysis. 

Isolation ofpUC18eryBVcas 

The gene eryBVwas amplified by PCR using the primers cas01eG21 (WO01/79520) 
and 7966 5'-GGGGAATTCAG^TCTGGTCJT^GAGGTCAGCCGGCGTGGCGGCGCGTG 
15 AGTTCCTCCAGTCGCGGGACGATCT -3' and pSG142 (Gaisser et ol., 2000) as template. 
The PCR fragment was cloned using standard procedures and plasmid pUClSeryBVcas was 
isolated with an Ndel site overlapping the start codon of eryBV and Xbal and BglR sites 
(underlined) following the stop codon. The construct was verified by sequence analysis. 

20 Isolation of vector pSGLitl 

The isolation of this vector is described in PCT/GB03/003230. 

Isolation ofpSGLitleryCIII 

Plasmid pSGCffl (WO01/79520) was digested with NdeVBglH. and the insert fragment was 
25 isolated and ligated with the NdeVBglB. treated vector fragment of pSGLitl. The ligation was 
used to transform E. coli ET12567 and plasmid pSGLitleryCIH was isolated using standard 
procedures. The construct was confinned using restriction digests and sequence analysis. This 
cloning strategy allows the introduction of a Ws-tag C-terminal of EryCHI. 

30 Isolation of pSGLitl tylMI 

Plasmid pSGTYLM2 (WO01/7952) was digested with NdeVBglU. and the insert fragment was 
isolated and ligated with the NdeVBgltl treated vector fragment of pSGLitl . The ligation was 
used to transform E. coli ET12567 and plasmid pSGLitl tylMI was isolated using standard 
procedures. The construct was confinned using restriction digests and sequence analysis. This 

35 cloning strategy allows the introduction of a his-tag C-terminal of TyUVffl. 
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Isolation ofpSG144 

Plasmid pSGLitl was isolated and digested with Ndel/BglR and an approximately 1.3 
kb insert was isolated. Plasmid pSG143 was digested with NdeVBglH, the vector band was 
isolated and ligated with the approximately 1.3 kb band from pSGLitl followed by 
transformation of E. coli DH10B. Plasmid pSG144 (F , ^) 2) was isolated and the construct 
was verified by DNA sequence analysis. This vector allows the assembly of gene cassettes 
directly in an expression vector (Figure 2) without prior assembly in pUC-derived vectors 
(WO 01/79520) in analogy to PCT/GB03/003230 using vector pSG144 instead of pSGsetl. 
Plasmid pSG144 differs from pSG142 in that the Xbal site between the thiostrepton resistance 
gene and the eryRHS has been deleted and the his- tag at the end of eryBV has been removed 
from pSG142 and replaced in pSG144 with an Xbal site at the end of eryBV. This is to 
facilitate direct cloning of genes to replace eryBV and then build up the cassette. 

Isolation ofpSGI44eryCHI 

EryCm was amplified by PCR reaction using standard protocols, with primers 
cas01eG21 (WO 01/79520) and caseryCHI2 (WO 01/79520) and plasmid pSGCHI (Gaisser et 
aL 9 2000) as template. The approximately 1 .3 kb PCR product was isolated and cloned into 
pUC18 using standard techniques. Plasmid pUCCHIcass was isolated and the sequence was 
verified. The insert fragment of plasmid pUCCHIcass was isolated after NdeVXbal digestion 
and ligated with the NdeVXbal digested vector fragment of pSG144. After the transformation 
of E. coli DH10B plasmid pSGUAeryCUI was isolated using standard techniques. 

Isolation ofpUC19tylAI 

Primers BIOSG34 5'- 
GGG CATATGA ACGACCGTCCCCGCCGCGCCATGAAGGG- 3' and 5'- 
CCCCTCTAGAGGTCACTGTGCCCGGCTGTCGGCGGCGGCCCCGCGC ATGG-3 ' were 
used with genomic DNA of Streptomyces fradiae as template to amplify tylAL The amplified 
product was cloned using standard protocols and plasmid p\JCl9 tylAI was isolated. The insert 
was verified by DNA sequence analysis. Differences to the published sequence are shown in 
Figure-3. 

Isolation of pSGLitl 

Plasmid Litmus 28 was digested with SpeVXbal and the vector fragment was isolated. 
Plasmid pSGLitl {dam) was digested with Xbal and the insert band was isolated and ligated 
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with the SpeUXbal digested vector fragment of Litmus 28 followed by the transformation of 
E. coli DH10B using standard techniques. Plasmid pSGLit2 was isolated and the construct 
was verified by restriction digest and sequence analysis. This plasmid can be used to add a 5' 
region containing an Xbal site sensitive to Dam raethylation and a Shine Dalgarno region thus 
5 converting genes which were originally cloned with an Ndel site overlapping the start codon 
and an Xbal site 3' of the stop codon for the assembly of gene cassettes. This conversion 
includes the transformation of the ligations into E. co//ET12S67 followed by the isolation of 
dam DNA and Xbal digests. Examples for this strategy are outlined below. 

1 0 Isolation ofpSGLit2tylAI 

Plasmid pSGLit2 and pUCl9tylAI were digested with Ndel I Xbal and the insert band 
of pUC19fy&4/and the vector band of pSGLit2 were isolated, ligated and used to transform E. 
coli ET12567. Plasmid pSGLitZtylAI (dam) was isolated. 

1 5 Isolation ofpUC19tylAII 
Primers 5' — 

CCCC TCTAGAG GTCATGCGCGCTCCAGTTCCCTGCCGCCCGGGGACCGCTTG- 3 ' 
and 5'- 

GG GTCTAGAT CGATTAATTAAGGAGGACATTCATGCGCGTCCTGGTGACCGGAGG 
20 TGCGGGCTTC ATCGGCTCGC ACTTC A- 3 9 and genomic DNA of Streptomyces fradiae as 
template were used for a PCR reaction applying standard protocols to amplify tylAIL The 
approximately 1 kb sized DNA fragment was isolated and cloned into SmaVzut pUC19 using 
standard techniques. The DNA sequencing of this construct revealed that 12 nucleotides at the 
5' end had been removed possibly by an exonuclease activity present in the PCR reaction. 
25 The comparison of the amino acid sequence of the cloned fragment compared to the published 
sequence is shown in Figure 4. 

Isolation ofpSGLit2tylAII 

To add the missing 5'-nucleotides, pSGLit2 was digested with PacVXbal and the 
30 vector fragment was isolated and ligated with the PacVXbal digested insert fragment of 
pUC19/y£4J7. The ligated DNA was used to transform E. coli ET12567 and plasmid 
pSGLit2tylAII(dam) was isolated. 
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Isolation qfplasmidpUC19eryCVI 

The eryCVI gene was amplified by PCR using primer BIOSG28 5'- 
GGGCATATGTACGAGGGCGGGTTCGCCGAGCTTTACGACC-3' and BIOSG29 5'- 
GGG GTCTAGAG GTCATCCGCGCACACCGACGAACAACCCG-3' and plasmid 
pNC062 (Gaisser et al, 1997) as a template. The PCR product was cloned into Smal digested 
pUC19 using standard techniques and plasmid pUC19eryCF/was isolated and verified by 
sequence analysis. 

Isolation of plasmid pSGLit2eryCVI 

Plasmid pUC19er>>CF7 was digested with NdeVXbal and ligated with the NdeVXbal 
digested vector fragment of pSGLit2 followed by transformation of E. coli ET12567. Plasmid 
pSGLit2eryCP7 (darn) was isolated. 

Isolation of plasmid pSG144tylAI 

Plasmid pSG144 and pUC190>MJ were digested with NdeVXbal and the insert band of 
p\5C\9tylAI m& the vector band of pSG144 were isolated, ligated and used to transform E. 
coli DH10B. Plasmid pSG144fyL4/was isolated using standard protocols. 

Isolation of plasmid pSG144tylAItylAII 

Plasmid pSGUHtylAII (dam) was digested v/iHiXbal and ligated with^al digested plasmid 
pSGl44tylAI. The ligation was used to transform E. coli DH10B and plasmid 
pSG\44tylAItylAIIv/as isolated and verified using standard protocols. 

Isolation of plasmid pSGLit2tylMIII 

Plasmid P UC18tylM3 (Isolation described in WOO 1/79520) was digested with NdeVXbal and 
the insert band and the vector band of NdeVXbal digested P SGLit2 were isolated, ligated and 
used to transform E. coli ET12567. Plasmid pSGUQ.tylMm (dam) was isolated using 
standard protocols. The construct was verified using restriction digests and sequence analysis. 

Isolation of plasmid pSGl 44tylAItylAHtylMIII 

Plasmid pSGUtltylMIII (dam) was digested with Xbal and the insert band was ligated with 
Xbal digested plasmid pSG\44tylAItylAII. The ligation was used to transform E. coli DH10B 
and plasmid pSGlUtylAItylAIItylMUI no36 was isolated using standard protocols. The 
construct was verified using restriction digests and sequence analysis. 
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Isolation of plasmid pSGLU2tylB 

Plasmid pUC18/yZB (Isolation described in WOO 1/79520) was digested with PacVXbal and 
the insert band and the vector band of PacVXbal digested pSGLit2 were isolated, ligated and 
used to transform E. coll ET12567. Plasmid pSGLit2/y/jB nol (dam) was isolated using 
5 standard protocols. 

Isolation of plasmid pSG144tylAItylAIItylMIIItylB 

Plasmid pSGLit2*y/2? (dam') was digested with Xbal and the insert band was ligated with 
Xbal digested plasmid pSGlAAtylAItylAUtylMUL The ligation was used to transform E. coli 
10 DH10B and plasmid pSGl44tylAItylAIItylAdIIItylB no5 was isolated using standard protocols 
and verified by restriction digests and sequence analysis. 

Isolation of plasmid pUC18tylIa 

Primers BIOSG 88 5 '-GG GCATATGG CGGCGAGC ACTACGACGGAGGGGAATGT-3 ' 
15 and BIOSG 89 5 > -GG GTCTAGAG GTCACGGGTGGCTCCTGCCGGCCCTCAG-3 9 were 
used to amplify tylla using a plasmid carrying the tyl region (accession number 
u08223.em_pro2) comprising ORF1 (cytochrome P450) to the end of ORF2 (TylB) as a 
template. Plasmid pUCtylla nol was isolated using standard procedures and the construct was 
verified using sequence analysis. 

20 

Isolation of plasmid pSGLit2tylIa 

Plasmid pUCtylla nol was digested with NdeVXbal and the insert band and the vector band 
of NdeVXbal digested pSGLit2 were isolated, ligated and used to transform E. coli ET12567. 
Plasmid pSGLit2fy//a no 54 (dam) was isolated using standard protocols. The construct was 
25 verified using sequence analysis. 

Isolation of plasmid pSG144tylAItylAIItylMUIty IB tylla 

Plasmid pSGLitZtylla (dam) was digested with Xbal and the insert band was ligated with 
Xbal digested plasmid pSGl44tylAItylAHtylMIIItylB. The ligation was used to transform E. 
30 coli DH10B and plasmid pSGYAAtylAItylAIItylMIIItylBtylla no3 was isolated using standard 
protocols and verified by restriction digests and sequence analysis. 

Isolation of plasmid pSGLUltylMeryCIU 

Plasmid pUCtylMI (Isolation described in WOO 1/79520) was Pad digested and the insert was 
3 5 ligated with the Pad digested vector fragment of pSGLit 1 eryCHI using standard procedures . 
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Plasmid pSGLitl^W47er);C/Z7no20 was isolated and the orientation was confirmed by 
restriction digests and sequence analysis. 

Isolation of gene cassette pSG144tylAItylAIItylMmtyWtylla(ylMIeryCIII 
Plasmid pSGUtltylMIeryCIU no20 was digested with XbaVBgia and the insert band was 
isolated and ligated with the XbaVBglH digested vector fragment of plasmid 
pSGltetylAItylAIItylAfflltylBtylla no3. Plasmid 

pSG144tylAItylAntylMmtylBtyllatylMeryCIIIvs?tt isolated using standard procedures and 
the construct was confirmed using restriction digests and sequence analysis. Plasmid 
preparations were used to transform SI erythraea mutant strains with standard procedures. 

Isolation of plasmid pSGKCl 

To prevent the conversion of the substrate 3-0-mycarosyl erythronolide B to 3,5-di-O- 
mycarosyl erythronolide B a further chromosomal mutation was introduced into S. erythraea 
SGQ2 (Isolation described in WO 01/79520) to prevent the biosynthesis of L-mycarose in the 
strain background. Plasmid pSGKCl was isolated by cloning the approximately 0.7 kb DNA 
fragment of the eryBVIgene by using PCR amplification with cosmid2 or plasmid pGGl 
(WOO 1/79520) as a template and with the primers 646 5'- 

CATCGTCAAGGAGTTCGACGGT- 3' and 874 5'-GCCAGCTCGGCGACGTCCATC- 
3' using standard protocols. Cosmid 2 containing the right hand site of the ery-cluster was 
isolated from an existing cosmid library (Gaisser et al, 1997) by screening with eryBVas a 
probe using standard techniques. The amplified DNA fragment was isolated and cloned into 
EcoRV digested pKCl 132 (Bierman et al, 1992) using standard methods. The ligated DNA 
was used to transform E. coli DH10B and plasmid pSGKCl was isolated using standard 
molecular biological techniques. The construct was verified by DNA sequence analysis. 

Isolation ofS, erythraea Q42/1 (Biot-2166) 

Plasmid pSGKCl was used to transform S. erythraea SGQ2 using standard techniques 
followed by selection with apramycin. Thiostrepton/apramycin resistant transformant S. 
erythraea Q42/1 was isolated. 

Byconversion using S. erythraea Q42/lpSG144tyLAItylAIItylhfflItyWtyllatylMIeryCm 
Byconversion assays using 3-O-mycarosyl erythronolide B are carried out as described in 
General Methods. Improved levels of mycaminosyl erythromycin A are detected in 
byconversion assays using £ erythraea 
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Q42l\pSG\44tylAItylAUtylMffltylBtyll atylMeryCm co to bioconversion levels 

previously observed (WOO 1/79520). 

Example 2: Isolation of mycaminosyl tylactone using gene cassette 
5 ^SGlUtylAItylAntylMIIItylBtyllatylMItylMn 

Isolation ofplasmidpSGLitltylMtylMII 

Plasmid pUCtylMl (Isolation described in WOO 1/79520) was Pacl digested and the insert was 
ligated with the Pad digested vector fragment of pSGLitltylMII using standard procedures. 
10 Plasmid pSGLitltylMItylMII nol6 was isolated and the construct was confirmed by 
restriction digests and sequence analysis. 

Isolation of plasmid pSG144tylAItyUIItylMmtylBtylIatyimtylMII 
Plasmid pSGlAtltylMItylMII no\6 was digested wiihXbaVBgni and the insert band was 
1 5 isolated and ligated with the Xbal/BglH digested vector fragment of plasmid 
pSG\44tylAItylAIItylMIIItylBtylIa no3. Plasmid 

pSG\44tylAItylAntylMintylBtyllatylMtylMnvtas isolated using standard procedures and the 
construct was confirmed using restriction digests and sequence analysis. The plasmid was 
isolated and used for transformation of S. erythraea mutant strains using standard protocols. 

20 

Bioconversion using gene cassette pSG144tylAItylMItylMmtylBtyllatylMItylMU 
The conversion of fed tylactone to mycaminosyl tylactone was assessed in bioconversion 
assays using S. erythraea Q42/lpSG\44tylAItylAntylMntylBtylIatyIMtylMH. 
Bioconversion assays were carried out using standard protocols (see Chemical Request sheet 

25 81). The analysis of the culture showed the major ion to be 568.8 [M+H] + consistent with the 
presence of mycaminosyl tylactone. Fragmentation of this ion gave a daughter ion of m/z 174, 
as expected for protonated mycaminose. No tylactone was detected during the analysis of the 
culture extracts, indicating that the bioconversion of the fed tylactone was complete. 
Recently, a homologue of Tylla was identified in the biosynthetic pathway of dTDP-3- 

3 0 acetamido-3 ,6-dideoxy-alpha-D-galactose in Aneurinibacillus thermoaerophilus L420-9 1 T * 
(Pfoestl et aL, 2003) and the function was postulated as a novel type of isomerase capable of 
synthesizing dTDP-6-deoxy-D-xy tohex-3 -ulose from dTDP-6-deoxy-D-xylohex-4-ulose. 
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Example 3: Bioconversion of 3-0-mycarosyl erythronolide B to 5-0-dedesosaminyl-i 
mycaminosyl erythromycins using gene cassette pSG1448/27/9S/21/44/193/6eryCIII 
(pSG144angAIangAIIoifl4angMniangBangMIeryCIII). 



Cloning ofangMffl by isolating plasmid Litl/4 

The gene angMfflv/as amplified by PCR using the primers BIOSG61 5'- 
GGG CATATGA GCCCCGCACCCGCCACCGAGGACCC -3' and BIOSG62 5'- 
GGTCTAGAGGTCAGTTCCGCGGTGCGGTGGCGGGCAGGTCAC -3'. Cosmid5B2 
containing a fragment of the angolamycin biosynthetic pathway was used as template. The 1.4 
kb PCR fragment (PCR nol) was cloned using standard procedures and EcoRV digested 
plasmid Litmus28. Plasmid Litl/4 was isolated with an MM site overlapping the start codon 
of angMW and an Xbal site following the stop codon. The construct was verified by sequence 
analysis. 

Isolation of plasmid pSGLit21/4 

Plasmid Litl/4 was digested with NdeVXbal and the about 1 .4 kb fragment was isolated and 
iigated to NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
ET12567 and plasmid P SGLit2i/V no7 {dam) was isolated. This construct was digested with 
Xbal and used for the construction of gene cassettes. 

Cloning ofangMII by isolating plasmid Lit2/8 

The gene angMII was amplified by PCR using the primers BIOSG63 5'- 
GGGCATATGCGTATCCTGCTGACGTCGTTCGCGCACAACAC -3* and BIOSG64 5'- 
GGTCTAGAGGTCAGGCGCGGCGGTGCGCGGCGGTGAGGCGTTCG -3' and 
cosmid5B2 containing a fragment of the angolamycin biosynthetic pathway was used as 
template. The 1.3 kb PCR fragment (PCR no2) was cloned using standard procedures and 
EcoRV digested plasmid Litmus28. Plasmid L02/8 was isolated with an Ndel site overlapping 
the start codon of angMII and anXbal site following the stop codon. The construct was 
verified by sequence analysis. 

Cloningjof angMII by isolating plasmid phitangMEfBglll) 
The gene angMII was amplified by PCR using primers BIOSG63 5'- 
GG GCATATGC GTATCCTGCTGACGTCGTTCGCGCACAACAC -3' and BIOSG80 5'- 
GG AGATCTG GCGCGGCGGTGCGCGGCGGTGAGGCGTTCG -3' and cosmid5B2 
containing a fragment of the angolamycin biosynthetic pathway as template. The 1.3 kb PCR 
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fragment was cloned using standard procedures and EcoRV digested plasmid Litmus28. 
Plasmid LitangMU(BglII)noS was isolated with an Ndel site overlapping the start codon of 
angMIIand a BglH site instead of a stop codon thus allowing the addition of a to-tag. The 
construct was verified by sequence analysis. 

5 

Isolation of plasmid pSGLitlangMU 

Plasmid LitangMII(Bgin) was digested with NdeVBglR and ligated with the NdeVBgtil 
digested vector fragment of pSGLitl. The ligation was used to transform E. coli ET 12567 and 
plasmid pSGLitl angMU (darn) was isolated using standard procedures. 

10 

Cloning of angMI by isolating plasmid LH3/6 

The gene angMI was amplified by PCR using the primers BIOSG65 5- 
GGGGAT^TGAACCTCGAATACAGCGGCGACATCGCCCGGTTG -3' and BIOSG66 5 ! - 
GGTCTAGAGGTCAGGCCTGGACGCCGACGAAGAGTCCGCGGTCG -3' and 
15 cosmid5B2 containing a fragment of the angolamycin biosynthetic pathway was used as 

template. The 0.75 kb PCR fragment (PCR no3) was cloned using standard procedures and 
EcoRV digested plasmid Litmus28. Plasmid lS\\3/6 was isolated with an Ndel site overlapping 
the start codon of angMI and an Xbal site following the stop codon. The construct was 
verified by sequence analysis. 

20 

Isolation of plasmid pSGlit23/6 no8 

Plasmid \At3/6 was digested with NdeUXbal and the about 0.8 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
ET12567 and plasmid pSGLit23/tf no8 {dean) was isolated. This construct was digested with 
25 Xbal and the isolated about 1 kb fragment was used for the assembly of gene cassettes. 

Cloning ofangB by isolating plasmid Lit4/ 19 

The gene angB was amplified by PCR using the primers BIOSG67 5'- 
GGGCAI^TOACTACCTauGTCt UUUAC 1 ACCTGGCGG -3 f and BIOSG68 5*- 

30 GGTCTAGAGGTCAGAGCGTGGCCAGTACCTCGTGCAGGGC -3' and cosmid4H2 

containing a fragment of the angolamycin biosynthetic pathway was used as template. The 1 .2 
kb PCR fragment (PCR no4) was cloned using standard procedures and EcoRV digested 
plasmid Litmus28. Plasmid IAU/19 was isolated with an Ndel site overlapping the start codon 
ofangB and an Xbal site following the stop codon. The construct was verified by sequence 

35 analysis. 
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Isolation of plasmid pSGlit24/ 19 

Plasmid UX4/19 was digested with NdeVXbal and the 1.2 kb fragment was isolated and 
ligated into NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
ET12567 and plasmid pSGLit24//P no24 {dam) was isolated. This construct was digested 
with Xbal and the isolated 1 .2 kb fragment was used for the assembly of gene cassettes. 

Cloning of or/14 by isolating plasmid LitS/2 

The gene orfl4 was amplified by PCR using the primers BIOSG69 5- 
GG GCATATGG TGAACGATCCGATGCCGCGCGGCAGTGGCAG-3' and BIOSG70 5 1 - 
GGTCIAGAGGTC AACCTCC AG AGTGTTTCG ATGGGGTGGTGGG-3 1 and cosmid4H2 
containing a fragment of the angolamycin biosynthetic pathway was used as template. The 1.0 
kb PCR fragment (PCR no5) was cloned using standard procedures and EcoKW digested 
plasmid Litmus28. Plasmid Lit5/2 was isolated with an Ndel site overlapping the start codon 
of ORF14 and an Xbal site following the stop codon. The construct was verified by sequence 
analysis. 

Isolation of plasmid pSGlit25/2 no24 

Plasmid LitJ/2 was digested with NdeVXbal and the approximately 1 kb fragment was 
isolated and ligated to NdeVXbal digested DNA of P SGLzY2. The ligation was used to 
transform E. coli ET12567 and plasmid pSGLit25/2 no24 {dam) was isolated. This construct 
was digested with Xbal, the about 1 kb fragment isolated and used for the assembly of gene 
cassettes. 

Isolation of plasmid pSGlit27/9 no 15 

Plasmid Lit7/P was digested with NdeVXbal and the approximately 1 kb fragment was 
isolated and ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to 
transform E. coli ET12567 and plasmid pSGLit27/9 nol5 {dant) was isolated. This construct 
was digested withJHtol and the isolated 1 kb fragment was used for the assembly of gene 
cassettes. 

Cloning ofangAI (or/2) by isolating plasmid Lit8/2 
The gene angAIv/as amplified by PCR using the primers BIOSG73 5 - 
GGGCATATG AAGGGC ATC ATCCTGGCGG.GCGGC AGCGGC-3 1 and BIOSG74 5*- 
GGTCTAGAGGTCATGCGGCCGGTCCGGACATGAGGGTCTCCGCCAC-3 1 and 
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cosmid4H2 containing a fragment of the angolamycin biosynthetic pathway was used as 
template. The around 1.0 kb PCR fragment (PCR no8) was cloned using standard procedures 
and EcoKV digested plasmid Litmus28. Plasmid LitS/2 was isolated with an Ndel site 
overlapping the start codon of angAI and an Xbal site following the stop codon. The construct 
5 was verified by sequence analysis. 

Cloning ofangAH (or/3) by isolating plasmid LU7/9 
The gene angAIIv/as amplified by PCR using the primers BIOSG71 5 1 - 
GG GCATATG CGGCTGCTGGTCACCGGAGGTGCGGGC-3 t and BIOSG72 5*- 
10 GGTCTAGAGGTC AGTCGGTGCGCCGGGCCTCCTGCG-3 1 and cosmid4H2 containing a 
fragment of the angolamycin biosynthetic pathway was used as template. The 1 .0 kb PCR 
fragment was cloned using standard procedures and EcoKV digested plasmid Litmus28. 
Plasmid Lit7/P was isolated with an Ndel site overlapping the start codon of angAII and an 
Xbal site following the stop codon. The construct was verified by sequence analysis. 

15 

Isolation of plasmid pSGlit28/2 no 18 (pSGLU2angAI) 

Plasmid LitS/2 was digested with NdeVXbal and the 1 kb fragment was isolated and ligated to 
NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli ET12567 
and plasmid pSGLit2S/2 no 18 (dam) was isolated. 

20 

Isolation of plasmid pSGl 448/2 (pSG144angAI) 

Plasmid LitS/2 was digested with NdeVXbal and the approximately 1 kb fragment was 
isolated and ligated with NdeVXbal digested DNA of pSG144. The ligation was used to 
transform E. coli DH10B and plasmid pSG144S/2 (dam ) (pSG!44angAI) was isolated using 
25 standard procedures. This construct was verified with restriction digests and sequence 
analysis. 

Isolation of plasmid pSGl 448/27/9 (pSG144angATangAU) 

Plasmid pSGLit2 7/9 (isolated from E.coli ET12567) was digested with^&al and the 1 kb 
30 fragment was isolated and ligated with the Xbal digested vector fragment of pSG 1445/2 
(pSG144ong47). The ligation was used to transform E. coli DH10B and plasmid 
pSG1445/27/P (pSG 1 AAangAIangAII) was isolated using standard protocols. The construct 
was verified with restriction digests and sequence analysis. 

35 Isolation of plasmid pSG 1448/27/9 1/4 (pSG144cmgAIangAIIangMIU) 
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Plasmid pSGLit2i/4 (isolated from E. coli ET12567) was digested with Xbal and the 1.4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of pSGl 448/2 7/9 
(pSG144angAIangAII). The ligation was used to transform E. coli DH10B and plasmid 
pSG\448/27/91/4 (pSG\44angAIangAHangMIII) was isolated using standard protocols. The 
5 construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSG 1448/27/9 1/44/19 (pSG144angAIangAIIangMIIIangB) 
Plasmid pSGLit24//P (isolated from E. coli ET12567) was digested with Xbal and the about 
1 .2 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
10 pSGl448/27/91/4 (pSG 1 44angAIangAIIangMIII). The ligation was used to transform E. coli 
DH10B and plasmid pSGl 445/2 7/91/44/19 (pSGl44cmgAIangAIIcmgMIIIangB) was isolated 
using standard protocols. The construct was verified with restriction digests and sequence 
analysis. 

1 5 Isolation of plasmid pSGl 448/27/91/44/1 93/6 (pSG144angAIangAnangMIIIangBangMI) 
Plasmid pSGLit23/6 (isolated from E. coli ET12567) was digested with Xbal and the about 
0.8 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
pSGl448/27/9 1/44/1 9 (pSGl44angAIangAIIangMIIIangB). The ligation was used to 
transform E. coli DH10B and plasmid pSG144S/27/P 1/44/1 93/6 

20 (pSG 1 44angAIangAIIangMIIIangBangMI) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGl 448/27/91/44/1 93/6eryCIH 
(pSG144cmgAIangAIIangMIIIangBangMIeryCIII) 
25 Plasmid pSGUtleryCIII (isolated from E. coli ET12567) was digested with XbaVBglD. and 
the about 1.2 kb fragment was isolated and ligated with the Xbal digested and partially BglU. 
digested vector fragment of pSG144S/27/9//^//93/5 

(pSGl44angAIangAIIangMIIIangBangMI). The Bglll partial digest was necessary due to the 
presence of a BglH site in angB. The ligation was used to transform E. coli DH10B and 
30 plasmid pSGl448/27/91/44/193/6eryCIH no9 

XpSGl44angAIcmgAIIangMIIIangBangMeryCIII) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. EryCEEI carries a to-tag 
fusion at the end. 
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Bioconversion of 3-O-mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin A using S. erythraea 2^2/ipSG1448/27/91/44/193/6eryCZff no9 
(pSG\44angAIcmgAIIangMUIangBangMIeryCIII) 

The S. erythraea strain Q42/lpSG144S/27/9 1/44/1 93/6eryCUI v/as grown and bioconversions 
5 with fed 3-O-mycarosyl erythronolide B were performed as described in the General 

Methods. The cultures were analysed and a small amount of a compound with m/z 750 was 
detected consistent with the presence of 5-0-dedesosaminyl-5-O-mycaminosyl erythromycin 
A. 

10 Isolation ofplasmid pSGl 448/27/95/2 (pSG144angAIangAIIorfl4) 

Plasmid pSGLit2J/2 (isolated from E. coli ET12567) was digested with^ial and the about 1 
kb fragment was isolated and ligated with the^M digested vector fragment of pSG144S/2 7/9 
(pSGl44angAIangAII). The ligation was used to transform E. coli DH10B and plasmid 
pSG144S/2 7/95/2 (pSG\44angAIangAHorfl4) was isolated using standard protocols. The 

1 5 construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGl 448/27/95/2 1/4 (pSG144angAIangAIIorfl4angMIII) 
Plasmid pSGLit27/4 (isolated from E. coli ET12567) was digested with Xbal and the 1 .4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
20 pSG144S/27/P5/2 (pSGl44angAIangAIIorfl4). The ligation was used to transform E. coli 

DH10B and plasmid pSG 1445/2 7/95/21/4 (pSG 1 44angAIangAUorfl 4angMHI) was isolated 
using standard protocols. The construct was verified with restriction digests and sequence 
analysis. 

25 Isolation ofplasmid pSG 1448/27/95/2 1/44/19 (pSG144angAIangAJIorfl4angMIIIangB) 

Plasmid pSGLit247P (isolated from E. coli ET12567) was digested with Xbal and the L2 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG144S/27/PJ/2//4 (pSG\44angAIangAUorfl4angA4III). The ligation was used to transform 
E. coli DH10B and plasmid pSG\448/27/95/2 1/44/1 9 

30 (pSG144angAIangAIIorfl4angMIIIangB) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation ofplasmid pSGl 448/27/95/2 1/44/1 93/6eryCIII 
(pSG144cmgAIangAIIoffl4cmgMUcmgBangMIeryCni) 
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Plasmid pSGl448/27/91/44/193/6eryCm no9 was digested with BglH and the about 2 kb 
fragment was isolated and ligated with the BglB. digested vector fragment of 
pSG\448/27/95/21/44/19 (pSG144angAIangAIIorfl4angMniemgB). The ligation was used to 
transform E. co/i'DHlOB and plasmid pSG144S/2 7/95/2 1/44/1 93/6eryCIII 
5 (pSG\44cmgAIangAIIorfl4angMinangBangMetyCIII) was isolated using standard 

protocols. The construct was verified with restriction digests and sequence analysis. 

EryCm carries a to-tag fusion at the end. The construct was used to transform S. erythraea 

SGQ2 using standard procedures. 

10 Byconversion of 3-O-mycarosyl erythronolide B to S-O-dedesosaminylS-O-mycaminosyl 
erythromycin A 

The S. erythraea strain SGQ2pSG144S/2 7/95/2 1/44/1 93/6eryCHI vras grown and 
bioconversions with fed 3-O-mycarosyl erythronolide B were performed as described in the 
General Methods. The cultures were analysed and improved amounts of a compound with m/z 
1 5 750 was detected consistent with the presence of 5-0-dedesosaminyl-5-0-mycaminosyl 

erythromycin A. Similar results were obtained with the S. erythraea strain Q42/1 containing 
the gene cassette pSGU48/27/95/21/44/193/6eryCIII. 

16 mg of the compound with m/z 750 was purified and the structure of 5-0-dedesosaminyl-5- 
O-mycaminosyl erythromycin A was confirmed by NMR analysis (See Table I and Figure 1). 
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Table II: 'Hrmd 13 C NMR data for 5-O-dedesosaminvl-^-n-mvcammosvl erythromycin A 





Position 


Sh 


Multiplicity 


1 






2 


2.83 


dq 


3 


3.91 


dd 


4 


2.00 


m 


5 


3.53 


d 


6 






7 


1.66 


dd 




1.82 


dd 


8 


2.69 


dqd 


9 






lO 


3.06 


qd 


11 


3.81 


d 


12 






13 


5.04 


dd 


14 


1.47 


dqd 




1.91 


ddq 


15 


0.83 


dd 


16 


1.18 


d 


17 


1.03 


d 


18 


1.44 


s 



Coupling S c 

T75X 



9.6, 7.1 44.9 
9.7, 1.6 80.0 

39.1 

6.8 85.4 

74.8 

14.8, 2.2 38.5 
14.8, 11.4 

11.3,7.0,2.2 44.9 
221.6 

6.9, i-3 38.0 

1.3 68.9 

74.6 

11.0,2.3 76.8 a 
14.3,11.0,7.2 21.1 
14.3, 7.5, 2.2 

7.4, 7.4 10.6 
7.1 16-0 

7.4 9.7 

26.6 



31 



Position 


5 H 


Multiplicity 


Coupling 


5c 


19 


1.16 


d 


7.0 


18.3 


20 


1.14 


d 


7.0 


12.0 


21 


1.12 


s 




16.2 


r 


4.87 


d 


4.8 


96.4 


2' 


1.55 


dd 


15.2, 4.8 


34.9 




2.32. 


dd 


15.2, 0.9 




3' 








72.8 


4' 


3.01 


d 


9.3 


77.8 


5' 


3.99 : 


dq 


9.3, 6.2 


65.6 


6' 


1.27' 


d 


6.2 


18.5 


7' 


1.23 


s 




21.4 


8' 


3.29 


s 




49.4 


1" 


4.43 


d 


7.4 


103.3 


2" 


3.56 


dd 


10.5, 7.3 


71.3 


3" 


2.48 


dd 


10.3, 10.3 


70.6 


4" 


3.09 


dd 


9.9, 9.0 


70.2 


5" 


3.31 


dq 


9.0, 6.1 


72.9 


6" 


1.29 


d 


6.1 


18.1 


7" 


2.58 


s 




41.7 



3 This carbon was assigned from the HMQC spectrum 



Example 4: Isolation of mycaminosyl tylactone 

Isolation ofplasmidpSG1448/27/95/21/44/193/6tylMII 
5 (pSG144angAIcmgAIIorfl4cmgMncmgB3/6tylMII) 

Plasmid pSGl448/27/91/44/193/6iylMI no9 was digested with BglH and the about 2 kb 
fragment was isolated and ligated with the BglR digested vector fragment of 
pSG144S/2 7/95/2 1/44/1 9 (pSGl44angAIangAIIorfl4angMIIZangB). The ligation was used to 
transform E. coli DH10B and plasmid pSGl448/27/95/2 1/44/1 93/6tylMn 
1 0 (pSG 1 AAangAIangAIIorfl4cmgMIIIangBangMItylMII) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. 
TylMII carries a /ws-tag fusion at the end. 

Bioconversion of tylactone to mycaminosyl tylactone 
15 The S. erythraea strain QA2l\pSG\AA8/27/95/21/44/193/6tylMII is grown and byconversions 
with fed tylactone is performed as described in the General Methods. The cultures are 
analysed and a compound with m/z 568 is detected consistent with the presence of 
mycaminosyl tylactone. 
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Example 5: Isolation of 5-0-dedesosaminyl-5-0-angolosaminyl erythromycins using 
gene cassette v SG1448/27/9I/4spn05/2p4/I93/6tylMHby byconversion of 3-0-mycarosyl 
erythronolide B. 

Isolation of plasmid com nol 

For the multiple use of promoter sequences in ac/-controlled gene cassettes a 240 bp fragment 
was amplified by PCR using the primers BIOSG78 5'- 

GGG CATATGT GTCCTCCTTAATTAATCGATGCGTTCGTCC-3' and BIOSG79 5'- 
GG AGATCTG GTCTAGATCGTGTTCCCCTCCCTGCCTCGTGGTCCCTCACGC -3' and 
plasmid pSG142 (Gaisser et ah, 2000) as template. The 0.2 kb PCR fragment (PCR no5) was 
cloned using standard procedures and £coRV digested plasmid Litmus28. Plasmid conv nol 
was isolated. The construct was verified by sequence analysis. 

Isolation of pSGLit3religl 

Plasmid conv nol was digested with NdeVBglB. and the about 0.2 kb fragment was isolated 
and ligated with the BamHl/Ndel digested vector fragment of pSGLit2. The ligation was used 
to transform E. coli DH10B and plasmid P SGLit3religl was isolated using standard 
procedures. This construct was verified using restriction digests and sequence analysis. 

Isolation of plasmid pSGlit34/l 9 

Plasmid Ut4/19 was digested with NdeVXbal and the 1.2 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLit3. The ligation was used to transform E. coli 
ET12567 and plasmid pSGLit34//9 no23 was isolated. This construct was digested with Xbal 
and the isolated 1 .4 kb fragment was used for the assembly of gene cassettes. 

Cloning oforft by isolating plasmid LU6/4 

The gene orft was amplified by PCR using the primers BIOSG75 5'- 
GGGCATATGAGC ACCCCTTCCGC ACC ACCCGTTCCG-3 ' and BIOSG76 5'- 
G GTCTAGAG GTCAGTACAGCGTGTGGGCACACGCCACCAG-3' and cosmid4H2 
containing a fragment of the angolamycin biosynthetic pathway was used as template. The 2.1 
kb PCR fragment (PCR no6) was cloned using standard procedures and EcoKV digested 
plasmid Litmus28. Plasmid Lit6/4 was isolated with an Ndel site overlapping the start codon 
of orf4 and an Xbal site following the stop codon. The construct was verified by sequence 
analysis. 
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Isolation of plasmid pSGlit26/4 no9 

Plasmid IAX6/4 was digested with NdeVXbal and the DNA was isolated and ligated to 
NdeUXbal digested DNA of pSGLit2. The ligation was used to transform E. coli ET12567 
and plasmid pSGLit2d/4 no9 was isolated. This construct was confirmed by restriction digests 
5 and sequence analysis. 

Cloning of spnO by isolating plasmid pUC19spnO 

The gene spnO from the spinosyn biosynthetic gene cluster of Saccharopolyspora spinosa 
was amplified by PCR using the primers BIOSG41 5- 

1 0 GG GCATATGA GCAGTTCTGTCGAAGCTGAGGCAAGTG-3 1 and BIOSG42 5'- 

GGTCTAGAGGTC ATCGCCCC AACGCCC AC AAGCTATGC AGG-3 1 and genomic DNA 
of S. spinosa as template. The about 1.5 kb PCR fragment was cloned using standard 
procedures and Smal digested plasmid pUC19. Plasmid \>\5C\9spnO no2 was isolated with an 
Ndel site overlapping the start codon of spnO and an Xbal site following the stop codon. The 

1 5 construct was verified by sequence analysis. 

Isolation of plasmid pSGlit2spnO no4 

Plasmid pUC\9spnO was digested with NdeVXbal and the 1.5 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
20 ET 12567 and plasmid pSGLitlspnO no 4 was isolated using standard procedures. This 

construct was digested with Xbal and the isolated 1 .5 kb fragment was used for the assembly 
of gene cassettes. 

Isolation of plasmid pSG 1448/27/9 l/4spnO (pSG144angAIangAIIangMIIIspnO) 
25 Plasmid pSGLitZspwO no4 (isolated from E. coli ET12567) was digested with Xbal and the 
1 .5 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG144S/27/97/4 (pSGl44angAIangAIIangMUI). The ligation was used to transform E. coli 
DH10B and plasmid pSGl448/27/91/4spnO ($SG\44angAIangAIIangMIIIspnO) was isolated 
using standard protocols. The construct was verified with restriction digests and sequence 
30 analysis. 

Isolation of plasmid pSG 1448/27/9 l/4spn05/2 (pSG144angAIangAIIangMIIIspnOangorfl4J 
Plasmid pSGLit2J/2 no24 (isolated from E. coli ET12567) was digested with Xbal and the 1 
kb fragment was isolated and ligated with the Xbal digested vector fragment of 
35 pSGl 448/2 7/91/4spnO (pSGl44angAIangAHangMUIspnO). The ligation was used to 
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transform R coli DH10B and plasmid pSG\448/27/91/4spnOS/2 

(pSG\44angAIangAIIangMIIIspnOangorfl4) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGl 448/27/9 l/4spn05/2p4/19 
(pSG144angAIangAIIangMIIIspnOangorfl 4pangB) 

Plasmid pSGLit34//P no23 (isolated from E. coli ET12567) was digested with Xbal and the 
about 1.4 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
pSGl448/27/91/4spnOS/2 (pSGl44angAJangAIIangMIIIspnOangorfl4). The ligation was 
used to transform E. coli DH10B and plasmid pSG\448/27/91/4spn05/2p4/19 
{pSGl44angAIangAIIangMIIIspnOang07fl4pangB) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis, 'p' indicates the 
presence of the promoter region in front of angB to emphasize the presence of multiple 
promoter sites in the construct. 

Isolation of plasmid pSG 1 448/27/9 l/4spnOS/2p4/l 9 3/6eryCUI 
(pSGI44angAIangAIIangAfflIspnOorfl4pangBangMeryCIII) 

Plasmid pSG144S/27/P 1/44/1 93/6eryCIU no9 was digested with BgDl and the about 2 kb 
fragment was isolated and ligated with the Bglil digested vector fragment of 
pSG\448/27/91/4spnQ5/2p4/19 (pSGl44angAIangAIIangMIIIspnOorfl4pangB). The 
ligation was used to transform E. coli DH10B and plasmid 
pSG\448/27/91/4spn05/2p4/193/6eryCm 

(pSG144angAIangAIIangMHIspnOorfl4pangBangMIeryCIII) was isolated using standard 
protocols. The construct was verified with restriction digests and sequence analysis. EryCDI 
carries a /ris-tag fusion at the end. 'p' indicates the presence of the promoter region in front of 
angB to emphasize the presence of multiple promoter sites in the construct. The plasmid 
construct was used to transform mutant strains of S. erythraea using standard procedures. 

Bioconversion of 3-O-mycarosyl erythronolide B to S-O-dedesoscmiinyl-5-O-angolosaminyl 
erythromycins 

Strain S. erythraea Q42/\pSGl448/27/91/4spnOS/2p4/193/6eryCIII'was grown and 
byconversions with fed 3-O-mycarosyl erythronolide B were performed as described in the 
General Methods. The cultures were analysed and peaks with m/z 704, m/z 718 and m/z 734 
consistent with the presence of angolosaminyl erythromycin D, B and A, respectively, were 
observed. 
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Example 6: Production of 5-0-angolosaminyl tylactone 

Isolation ofplasmid P SG1448/27/91/4spn05/2p4/193/6tylMn 
(pSG144cmgAIangAIIcmgMIIkpnOoTfl4pcmgBangM^ 

Plasmid pSG1448/27/91/44/193/60>/A4Z7no9 was digested with BglR and the about 2 kb 
fragment was isolated and ligated with the BgttL digested vector fragment of 
pSGl448/27/91/4spn05/2p4/19 {^SGUAangAIcmgAUangMIIIspnOorfl4pangB). The 
ligation was used to transform E. coli DH10B and plasmid 
pSG\448/27/91/4spn05/2p4/193/6tyIMII 

(pSGJ44angAIcmgAIIangMIJIspnOorfl4pangBangMItylAiL0 was isolated using standard 
protocols. The construct was verified with restriction digests and sequence analysis. TylMH 
carries a /i/s-tag fusion at the end. The plasmid was used to transform mutant strains of & 
erythraea applying standard protocols. c p 5 indicates the presence of the promoter region in 
front of angB to emphasize the presence of multiple promoter sites in the construct. 

Isolation ofS. erythraea 18Al(BIOT-2634) 

To introduce a deletion comprising the PKS and majority of post PKS genes in S. erythraea a 
region of the left hand side of the ery- cluster (LHS) containing a portion of eryCI 9 the 
complete ermE gene and a fragment of the eryBI gene were cloned together with a region of 
the right hand side of the ery- cluster (RHS) containing a portion of the eryBVIIgene, the 
complete eryKgene and a fragment of DNA adjacent to eryfL This construct should enable 
homologous recombination into the genome in both LHS and RHS regions resulting in the 
isolation of a strain containing a deletion between these two regions of DNA. The LHS 
fragment (2201 bp) was PCR amplified using & erythraea chromosomal DNA as template 
and primers BIdelNde (5'-CCCAl^IGACCGGAGTTCGAGGTACGCGGCTTG-3 9 ) and 
BIdelSpe (5'-GATACTAGTCCGCCGACCGCACGTCGCTGAGCC-3 '). Primer BIdelNde 
contains an Ndel restriction site (underlined) and primer BIdelSpe contains a Spel restriction 
site used for subsequent cloning steps. The PCR product was cloned into the Smal restriction 
site of pUC19, and plasmid pLSB177 was isolated using standard procedures. The construct 
was confirmed by sequence analysis. Similarly, RHS (2158 bp) was amplified by PCR using 
S. erythraea chromosomal DNA as template and primers BVHdelSpe (5'- 
TGCACTAGTGGCCGGGCGCTCGACGTCATCGTCGACAT-3 *) and BVHdelEco (5'- 
TCGATATCGTGTCCTGCGGTTTCACCTGCAACGCTG-3 '). Primer BVHdelSpe contains 
a Spel restriction site and primer BVHdelEco contains an EcoKV restriction site. The PCR 
product was cloned into the Smal restriction site of pUC19 in the orientation with Spel 
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positioned adjacent to Kpnl and EcdKV positioned adjacent to Xbal. The plasmid pLSB 178 
was isolated and confirmed using sequence analysis. Plasmid pLSB177 was digested with 
Ndel and Spel, the ~2.2kb fragment was isolated and similarly plasmid pLSB178 was 
digested with Ndel and Spel and the -4.6 kb fragment was isolated using standard methg^s. 
Both fragments were ligated and plasmid pLSB188 containing LHS and RHS combined 
together at a Spel site in pUC19 was isolated using standard protocols. An NdeVXbal 
fragment (-4.4 kbp) from pLSB188 was isolated and ligated with Spel and Ndel treated 
pCJR24. The ligation was used to transform E. coli DH10B and plasmid pLSB189 was' 
isolated using standard methods. Plasmid pLSB189 was used to transform S. erythraea P2338 
and transformants were selected using thiostrepton. S. erythraea Del 18 was isolated and 
inoculated into 6 ml TSB medium and grown for 2 days. A 5% inoculum was used to 
subculture this strain 3 times. 100 |il of the final culture were used to plate onto R2T20 agar 
followed by an incubation at 30°C to allow sporulation. Spores were harvested, filtered, 
diluted and plated onto R2T20 agar using standard procedures. Colonies were replica plated 
onto R2T20 plates with and without addition of thiostrepton. Colonies that could no longer 
grow on thiostrepton were selected and further grown in TSB medium. S. erythraea 18A1 
was isolated and confirmed using PCR and Southern blot analysis. The strain was designated 
LB-1 /BIOT-2634. For further analysis, the production of erythromycin was assessed as 
described in General Methods and the lack of erythromycin production was confirmed. In 
byconversion assays this strain did not further process fed erythronolide B and erythromycin 
D was hydroxy lated at C12 to give erythromycin C as expected, indicating that EryK was still 
functional. 

Bioconversion of tylactone toS-O- angolosaminyl tylactone 

Strain S. erythraea SGQ2pSGU48/27/91/4spnOS/2p4/J93/6tylA4IIv/as grown and 
byconversions with fed tylactone were performed as described in the General Methods. The 
cultures were extracted and analysed. A compound consistent with the presence of 
angolosaminyl tylactone was detected. 20 mg of this compound were purified and the 
structure was confirmed by NMR analysis. A compound consistent with the presence of 
angolosaminyl tylactone was also obtained when the gene cassette 

pSGU4S/J7/91/4spn05/2p4/193/6tylJmvfas expressed in the S. erythraea strain Q42/1 or S. 
erythraea 18A1. 

Table III: NMR data for 5-Q- 0D angolosaminyl Tvlactone 
# 5^ 8 H (mult., Hz) COSY H-H HMBC H-C ~ 
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1 

1 


174.4 








2 


39.8 


1.91 d (16.8) 
2.46 dd(16.8, 10.5) 


2b 
2a, 3 


1,3 
1 


3 


66.9 


3.68 dd(10.5, 1.2) 


2b 


1 


4 


40.4 


1.56 m 


5, 18 


3 


5 


80.7 


3.76 d (10.3) 


4 


4, 7, 18, 19, r 


6 


38.7 


2.68 m 


7b 




7 


33.6 


1.45 m 
1.55 m 


6 




8 


45.0 


2.70 m 


21 




9 


203.9 


- 






10 


118.3 


6.26 d (15.5) 


11 


12 


11 


147.7 


7.27 d (15.5) 


10 


9, 12, 13, 22 


12 


133.5 








13 


145.4 


5.60 d (10.4) 


14, 22 


11, 14, 22, 23 


14 


38.3 


2.70 m 


13, 15, 23 


12, 13, 15,23 


15 


78.8 


4.68 td (9.7, 2.4) 


14, 16b 


1, 17 


16 


24.7 


1.55 m 


15, 16b, 17 


15 


1.82 ddd 


16a, 17 


18 


17 


9.6 


0.91 1(7.2) 


16 


15, 16 


18 


9.7 


0.91 d (7.2) 


4 


3, 4, 5 


19 


21.0 


1.55 m 


20 




20 


11.8 


0.83 t (7.2) 


19 


6, 19 


21 


17.1 


1.15 d(6.8) 


8 


7,9 


22 


13.0 


1.76 s 


13 


11, 12, 13 


23 


16.1 


1.05 d(6.5) 


14 


13, 14, 15 


1 


101.0 


4.41 d(8.6) 


2* 


2' 


2' 


28.0 


1 AO . i i 

1.4s m 


1 , 2b , 3 


1,3,4 


2.05 ddd (10.4, 3.9, 1.6) 


2a', 3' 


l',3' 


3' 


65.8 


2.89 td (10.0, 3.9) 


2a', 2b', 4' 


4' 


4' 


70.5 


3.16 dd (9.5, 9.0) 


3', 5' 


3', 5', 6' 


5' 


73.2 


3.26 dq (9.6, 6.0) 


4', 6' 




6' 


17.7 


1.3 d (6.0) 


5' 





Isolation of plasmid pSGl 448/27/9 !/4spnOp5/2 
(pSGl 44 an gAIangAIIangMffispn Opango rfi 4) 

Plasmid pSGLit3 5/2 (isolated from E. coli ET12567) was digested with^6al and the insert 
fragment was isolated and ligated with thecal digested vector fragment of 
pSG\448/27/91/4spnO (pSG\44cmgAJangAnangMIIIspnO). The ligation was used to 
transform E. coli DH10B and plasmid pSGl 445/2 7/91/4spnOp5/2 
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(pSGl44angAIangAIIangMIIIspnOpangorfl4) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 



Isolation ofplasmidpSG1448/27/91/4spnOp5/24/19 

5 (pSG144angAIangAII<mgMIIIspnOpcmgOTfl 4angB) 

Plasmid pSGLit2¥/7P (isolated from £ coli ET12567) was digested with Xbal and the insert 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG\448/27/91/4spnOp5/2 (pSG\44cmgAIcmgAIIcmgMmspnOpangorfl4). The ligation was 
used to transform E. coli DH10B and plasmid pSG\448/27/91/4spnOp5/24/19 

10 {pSGl44cmgAIcmgAIIangMIIIspnOpangorfl4angB) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGI448/27/9I/4spnOp5/24/1 93/6 
(pSG144angAIangAIIcmgAfIIIspnOpangOTfl4angBangMI) 

1 5 Plasmid pSGLit23/tf (isolated from E. coli ET12567) was digested with Xbal and the insert 
fragment was isolated and ligated with thecal digested vector fragment of 
pSGl448/27/91/4spnOp5/24/19 (pSGl44angAIangAIIangAdIIIspnOpango?fl4angB). The 
ligation was used to transform E. coli DH10B and plasmid pSG\448/27/91/4spnOp5/24/193/6 
(pSG\44cmgAIangAnangAdIIIspnOpcmgorfl4angBangMI) was isolated using standard 

20 protocols. The construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGl 448/27/9 l/4spnOp5/24/193/6angMII 
(pSG144angAIangAIIangMIIspnOpangorfl4angBcmgMcmgMII) 

Plasmid pSGLitlowgAGZ (isolated from E. coli ET12567) was digested with XbaVBglil and 
25 the insert fragment was isolated and ligated with the Xbal and partial BgKl digested vector 
fragment of pSG 1445/2 7/91/4spnOp5/24/193/6 

(pSGl44angAIangAIIcmgAfflIspnOpcmgorfl4angBangMI). The ligation was used to 
transform E. coli DH10B and plasmid pSG\448/27/91/4spnOp5/24/193/6angMII 
(pSG 1 44cmgAIangAIIcnigMIUspnOpangorfl4cmgBcmgMangMII) was isolated using 
30 standard protocols. The construct was verified with restriction digests and sequence analysis. 
The plasmid was used to transform mutant strains of S. erythraea with standard procedures. 

Biotransformation using S. erythraea Q42/1 pSGl 448/27/9 l/4spnO P 5/24/193/6angMn 
(pSG144angAIangAIIcmgMinspnOpangorfl4angBcmgMangMU) 
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Biotransformation experiments feeding tylactone are carried out as described in General 
Methods and the cultures are analysed. Angolosaminyl tylactone is detected. 

Isolation of plasmid pSG 1448/27/96/4 (pSG144angAIangAUangorf4) 
Plasmid pSG1448/27/9 (pSG 1 44angAIangAU) was digested with Jftal and treated with 
alkaline phosphatase using standard protocols. The vector fragment was used for ligations 
with Xbal treated plasmid pSGU\26/4 no9 followed by transformations of E. coli DH10B 
using standard protocols. Plasmid pSG\448/27/96/4 (pSG 1 44angAIangAUangorf4) was 
isolated using standard procedures and the construct was confirmed by restriction digests and 
sequence analysis. 

Isolation of plasmid pSG1448/27/96/4p5/2 {pSG144angAIangAIIangorf4pangorfl4) 
Plasmid pSGLit3 5/2 (isolated from E. coli ET12567) was digested v/iftiXbal and the insert 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG\448/27/96/4 (pSGl44angAIangAIIangorf4). The ligation was used to transform E. coli 
DH10B and plasmid pSGl448/27/96/4p5/2 (pSGl44angAIangAIIangoTf4pangorfI4) was 
isolated using standard protocols. The construct was verified with restriction digests and 
sequence analysis. 

Isolation of plasmid pSG1448/27/96/4p5/2 1/4 
(pSG144angAIangAIIangorf4pangOTfl4angMIII) 

Plasmid pSGLit27/4 (isolated from E. coli ET12567) was digested with Xbal and the 1 .4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG\448/27/96/4p5/2 {pSG\44angAIangAIIangorf4pangorfl4). The ligation was used to 
transform E. coli DH10B and plasmid pSG144S/2 7/96/4p5/2 1/4 

(pSGl44angAIangAIIangorf4pangorfl4angMIII) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSG1448/27/96/4p5/2 1/44/19 
(pSG144angAIangAIIango?f4pangorfl4angMIIIangB) 

Plasmid pSGUt24/19 (isolated from E. coli ET12567) was digested with Xbal and the 1.4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG\448/27/96/4p5/21/4 (pSG 1 44angAIangAIIangorf4pangorfl4angMin). The ligation was 
used to transform E. coli DH10B and plasmid pSG\448/27/96/4p5/2 1/44/19 
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(pSG\44angAIcmgAIIangorf4pangorfl4cmgMniangB) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid P SG1448/27/96/4p5/21/44/193/6angMII 
(pSG144coigAIangAIIcmgorf4pcmgorfl4mgMnangBangMangMI) 

Plasmid pSGU48/27/91/4spnO P 5/24/193/6angMn was digested with BgUl and the about 2.2 
kb fragment was isolated and used to ligate with the BgM treated vector fragment of 
pSG\448/27/96/4p5/21/44/19. The ligation was used to transform E. coli DH10B using 
standard procedures and plasmid pSG\448/27/96/4p5/21/44/193/6angMn 
{pSG\AAa n gAIcmgAIImgorf4pangorfl4cmgMUangBangMIarigMII) was isolated. The 
construct was verified using restriction digests and sequence analysis. The plasmid was used 
to transform mutant strains of 5. erythraea with standard protocols. 

Byconversion oftylactone with S. erythraea Q42/1 P SG1448/27/96/4p5/21/44/193/6angMn 
i (pSG144angMangAIIangorf4pangorfl4angMniangBangMIangMII) 

Biotransformation experiments feeding tylactone are carried out as described in General 
Methods and the cultures are analysed. Angolosaminyl tylactone is detected. 

Example 7: Cloning of eryK into the gene cassette pSG144 
0 Isolation of plasmid pUC19eryK 
To amplify eryK primers eiyKl 5'- 

GG TCTAGAC TACGCCGACTGCCTCGGCGAGGAGCCC-3' and eryK2: 5'- 
GGCATATGTTCGCCGACGTGG AAACGACCTGCTGCG-5 ' were used and the PCR 
product was cloned as described for p\3C\9eryCVI. Plasmid p\JCl9<tryK was isolated. 



25 



30 



Isolation of plasmid pLSBl 11 (pCJK24eryK) 

Plasmid p\5C\9eryK was digested with NdeVXbal and the insert band was ligated with 
NdeVXbal digested pCJR24. Plasmid pLSBl 1 1 ( P CJR24erW0 was isolated and the construct 
was verified with restriction digests. 



Isolation of plasmid pLSB115 

Plasmid pLSBl 1 1 (pCJKZAeryK) was digested with NdeVXbal and the insert fragment was 
isolated and ligated with the NdeVXbal digested vector fragment of plasmid P SGLit2 and 
plasmid pLSBl 15 was isolated using standard protocols. The plasmid was verified using 
3 5 restriction digestion and DNA sequence analysis. 
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Isolation of plasmid pSGl 448/27/95/2 l/4eryK 

Plasmid pLSBl 15 from E. coli ET12567 was digested with Xbal and the insert fragment was 
isolated and ligated with the Xbal treated vector fragment of pSGl448/2 7/95/2 1/4 
(pSGl44angAIangAIIangorfl4angMIII), The ligation was used to transform E. coli DH10B 
with standard procedures and plasmid pSGl 448/2 7/95/2 l/4eryK 

(pSG\44cmgAIangAJIangorfl4angAdIIIeryK) is isolated. The construct is confirmed with 
restriction digests. 

Isolation of plasmid pSG 1448/27/9 5/21 /4eryK4/ 19 

Plasmid pSGLit24/19 from E. coli ET 12567 is digested with Xbal and the insert fragment is 
isolated and ligated with the Xbal treated vector fragment of plasmid 
pSGl448/27/95/21/4eryK. The ligation is used to transform E. coli DH10B with standard 
procedures and plasmid pSG\448/27/95/21/4eryK4/19 

(pSG 1 44angAIangAIIarigorfl 4angMIUeryKangE) is isolated. The construct is confirmed 
with restriction digests. 

Isolation of plasmid pSG1448/27/95/21/4eryK4/193/6eryCIII 

Plasmid pSGl 446/2 7/P5/2 1/44/1 93/6eryCIII is digested with BglR and the about 2.1 kb 
fragment is isolated and ligated with the BglR treated vector fragment of 
pSG!448/27/95/21/4eryK4/19. Plasmid pSGl 448/2 7/95/2 l/4eryK4/193/6eryCIU is isolated 
using standard procedures and the construct is confirmed using restriction digests. The 
plasmid is used to transform mutant strains of S. erythraea with standard methods. 

Bioconversion of 3-O-mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin A 

The & erythraea strain Q42/lpSGl448/27/95/2 l/4eryK4/193/6eryCHI is grown and 
byconversions with fed 3-O-mycarosyl erythronolide B are performed as described in the 
General Methods. The cultures are analysed and a compound with m/z 750 is detected 
consistent with the presence of 5-O-dedesosaminyl-5-0-mycaminosyl erythromycin A. 

Example 8: Production of 13-desethyl-13-methyl-5-0-mycaminosyl erythromycins A 
and B; 13-desethyl-13-isopropyl-5-0-inycaminosyI erythromycin A and B; 13-desethyl- 
13-secbutyI-5-0-mycaminosyl erythromycin A and B 
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Production of 13-desethyl-13-methyl-3-0-mycarosyl erythronolide B, 13-desethyl-13- 
isopropyl-3-O-mycarosyl erythronolide B and 13-desethyl-13-secbutyl-3-0~mycarosyl 
erythronolide B 

Plasmid pLS025, (WO 03/033699) a pCJR24-based plasmid containing the DEBS1, DEBS2 
and DEBS3 genes, in which the loading module of DEBS1 has been replaced by the loading 
module of the avermectin biosynthetic cluster, was used to transform £ erythraea 
JC2AeryCm (isolated using techniques and plasmids described previously (Rowe et al, 1998; 
Gaisser et aL, 2000)) using standard techniques. The transformant JC2AeryCIHpLS025 was 
isolated and cultures were grown using standard protocols. Cultures of £ erythraea 
JC2AeryCmpLS025 are extracted using methods described in the General Methods section 
and the presence of 3-0~mycarosyl erythronolide B, 13-desethyl-13-methyl-3-0-mycarosyl 
erythronolide B, 13-desethyl-13-isopropyl-3-0-mycarosyl erythronolide B and 13-desethyi- 
13-secbutyl-3-0-mycarosyl erythronolide B in the crude extract is verified by LCMS analysis. 

1 5 Production of 13'desethyU13-methyU5^0'dedesosminyU5-0^mycaminosyl erythromycin A 

and B, 13-desethyl'13-isopropyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A andB, 
13^esethyU13-secbutyl-5-0-dedesosmrnyl-5-0-mycaminosyl erythromycin A and B 
Cultures of £ erythraea JC2AeiyCIIIpLS025 are extracted using methods described in the 
General Methods section and the crude extracts are dissolved in 5 ml of methanol and 

20 subsequently fed to culture supernatants of the £ erythraea strain 

SGQ2pSGl448/27/95/2 1/44/1 93/6eryCUI using standard techniques. The bioconversion of 
13^esethyl-13-methyl-3-0-mycarosyl erythronolide B, 13-desethyl-13-isopropyl-3-0- 
mycarosyl erythronolide B and 13-desethyl-13-secbutyl-3-0-mycarosyl erythronolide B to 
13-desethyH3-methyl-5-0-dedesosaminyl-5-0-mycaminosyi erythromycin A and 13- 

25 desethyH3-methyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B; 13-desethyl-13- 
isopropyl-S-O-dedesosaminyl-S-O-mycaminosyl erythromycin A and 13-desethyl-13- 
isopropyl-S-O-dedesosaminyl-5-O-mycaminosyl erythromycin B;13-desethyl-13-secbutyl-5- 
O-dedesosaminyl-5-O-mycaminosyl erythromycin A and 13-desethyl-13-secbutyl-5-<3- 
dedesosaminyl-5-O-mycaminosyl erythromycin B is verified by LCMS analysis. 

30 

Example 9 ; 13-desethyl-13-methyl-5»0-dedesesaminyl-5-0^ycaminosyl erythromycin 
A and 13-desethyl-13-methyl-5-0-dedesosaminyl-S.O-mycaminosyl erythromycin B 

Production of l3-desethyl-13-methyl-3-0-mycarosyl erythronolide B 
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Plasmid pIB023 (Patent application no 0125043.0), a pCJR24-based plasmid containing the 
DEBS1, DEBS2 and DEBS3, was used to transform S. erythraea JC2AeryCm using standard 
techniques. The transfonnant JC2AeryCIIIpIB023 was isolated and cultures were grown 
using standard protocols, extracted and the crude extract was assayed using methods 
5 described in the General Methods section. The production of 3-0-mycarosyl erythronolide B, 
and 13-desethyl-13-methyl-3-0-mycarosyl erythronolide B is verified by LCMS analysis. 

Production of 13-desethyl-13-methyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A, 
13-desethyl-13-methyL5-0~dedesoscmiinyl-5-0-mycaminosyl erythromycin B 
10 Cultures of & erythraea JC2AeryCIIIpIB023 are extracted using methods described in the 
General Methods section and the crude extracts are dissolved in 5 ml of methanol and 
subsequently fed to culture supernatants of S. erythraea 

SGQ2pSG144S/27/95/2 1/44/1 93/6eryCIII using standard techniques. The byconversion of 
13-desethyl-13-methyl-3-0-mycarosyl erythronolide B to 13-desethyl-13-methyl-5-0- 
15 dedesosaminyl-5-O-mycaminosyl erythromycin A and 13-desethyM3-methyl-5-0- 
dedesosaminyl-5-O-mycaminosyl erythromycin B are verified by LCMS analysis. 

Example 10: Production of 5-0-dedesosaminyI-5-<?-mycaminosyl azithromycin 

20 Azithromycin aglycones were prepared using methods described in EP1024145A2 (Pfizer 

Products Inc. Groton, Connecticut). The S. erythraea strain SGT2pSG142 was isolated using 
techniques and plasmid constructs described earlier (Gaisser et al., 2000). Feeding 
experiments are carried out using methods described previously (Gaisser et cd. 9 2000) with the 
S. erythraea mutant SGT2pSG142 thus converting azithromycin aglycone to 3-O-mycarosyl 

25 azithronolide. Biotransformation experiments are carried out using S. erythraea 

SGQ2pSG1445/27/P5/2 1/44/1 93/6eryCIII and crude extracts containing 3-Omycarosyl 
azithronolide are added using standard microbiological techniques. The byconversion of 3-O- 
mycarosyl azithronolide to 5-0-dedesosaminyl-5-0-mycaminosyI azithromycin is verified by 
LCMS analysis. 

30 

Example 11: Production of S-O-dedesosaminyl-5-O-mycaminosyl erythromycin C 



Isolation of the S. erythraea mutant SGP1 (SGQ2AeryG) 

To create a chromosomal deletion in eryG 9 construct pSGAG3 was isolated as follows: 
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Fragment 1 was amplified using primers BIOSG53 5'- 
G GAATTCG GCCAGGACGCGTGGCTGGTCACCGGCT -3' and 

BIOSG54 5 ' <3GTCTAGAAAG AGCGTG AGC AGGCTCTTCT AC AGCC AGGTC A -3 ' and 
genomic DNA of S. erythraea was used as template. Fragment 2 was amplified using primers 
BIOSG55 5'-GG£ATGCAGGAAGGAGAGAACCACGATGACCACCGACG-3' and 
BIOSG56 5^GTCT^GACACCAGCCGTATCCTTTCTCGGTTCCTCTTGTG.3' and 
genomic DNA of S. erythraea was used as template. Both DNA fragments were cloned into 
Smal cut pUC19 using standard techniques, plasmids pUCPCRl and pUCPCR2 were isolated 
and the sequence of the amplified fragments was verified. Plasmid pUCPCRl was digested 
using EcoJWXbal and the insert band DNA was isolated and cloned into EcoMJXbal digested 
pUC19. Plasmid pSGAGl is isolated using standard methods and digested with SphVXbal 
followed by a ligation with the SphVXbal digested insert fragment of pUCPCR2. Plasmid 
pSGAG2 is isolated using standard procedures, digested with SphVHindm and ligated with 
the SphVHindm fragment of P CJR24 (Rowe et al., 1998) containing the gene encoding for 
1 5 thiostrepton resistance. Plasmid pSGAG3 is isolated and used to delete eryG in the genome of 
S. erythraea strain SGQ2 using methods described previously (Gaisser et al., 1997; Gaisser et 
al., 1998) and the S. erythraea mutant SGP1 (SGQ2AeryG) is created. 

Production 0 f5-0-dedesosaminyl-5-0-mycaminosyl erythromycin C 
20 The S. erythraea strain SGP1 (S. erythraea SGQ2AeryG) is isolated using standard 
techniques and consequently used to transform the cassette construct 
pSG144S/27/95/2 1/44/1 93/6eryCIU as formerly described. The S. erythraea strain 
SGP\j>SGl448/27/95/21/44/193/6eryCIII is isolated and used for biotransformation as 
described in Example 2 and assays are carried out as described above to verify the conversion 
25 of 3-0-mycarosyl-erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin C 
by LCMS analysis. 
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Figure 7 

1 GGCATGCCTT CGGGGTGTGC GGCGGCGCCT CAGAGCGTGG CCAGTACCTC 
51 GTGCAGGGCC GCGATCACCT TGTCCTGTAC GTCGGGCGCG AGCCCCGGGT 
101 ACATCGGCAG CGAGAAGATC TCGTCCGCCA GCCGCTCCGT CACCGGCAGC 
151 GAGCCCTTGG CGTACCCCAG GTGCGCGAAG CCCGTCATGG TGTGCACGGG 
201 CCACGGGTAA CTGATGTTGA GCGAGATCCC GTACGACTTG AGCGCCTCGA 
251 TGATGTCGTC CCGGCGCGGG TGGCGGACGA CGTACACGTA ATACACGTGG 
301 TCGTTGCCCT CGGTGACGGA CGGCAGCACC AGGCCGCCGG GGCCCGTCAG 
351 GTTCGCGAGT CCTTCGGCGT AACGCCGGGC GACCGCGCGC CGGCCCTCGA 
401 TGTAGCGGTC GAGGCGGGTG AGCTTGCGGC GCAGGATCTC CGCCTGCACC 
451 TCGTCGAGCC GGCTGTTGTG GCCGGGCGTC TGCACGACGT AGTACACGTC 
501 CTCCATGCCG TAGTAGCGCA GCCGGCGCAG CGCACGGTCG ACGTCCGCGT 
551 CGTCGGTCAG CACGGCCCCG CCGTCGCCGT ACGCACCGAG GACCTTCGTC 
601 GGGTAGAACG AGAAGGCGGC GGCGTCGCCC AGCGTGCCGG CCAGCTCGCC 
651 GTGGTGGCGG GCACCGTGCG CCTGGGCGCA GTCCTCCAGC ACCACCAGGC 
701 CGTGCTGCTC GGCCAGGGCG CGCAAGGGCG CCATGTCGAC GCACTGCCCG 
751 TACAGGTGCA CCGGCAGCAG GGCCTTCGTG CGCGGGGTGA TGACGTCCGC 
801 GACCTGGTCG GTGTCCATGA GGTGGTCCTC GGCGCGGACG TCGACGAAGA 
851 CGGGCGTGGC ACCGGTGCCG TCGATGGCCA CCACCGTCGG CGCGGCCGTG 
901 TTGGAGACGG TGACGACCTC GTCCCCCGGG CCCACCCCGA GCGCCTGCAG 
951 ACCCAGCTTG ACGGCGTTGG TGCCGTTGTC GACACCGCCG CAGTGGCGCA 
1001 GGCCGTGGTA GTCCGCGAAC TCCTTCTCGA ACCCGTCCAC GCTGGGGCCG 
1051 AGGACCAACT GCCCGGAGGC GAAGACGGIC TCGACGGCGT CGAGGAGGTC 
1101 CGCGCGTTCG TTCTGGTATT CCGCCAGGTA GTCCCAGACG TAGGTAGTCA 
1151 CGGAGAGCTC AACCTCCAGA GTGTTTCGAT GGGGTGGTGG GAAGCCGGTG 
1201 CGCGCGGACC AGGTCGTGCC AGCAGTCGCG GACCGACTCC CGCAGCGAAC 
1251 GGCGCGGTGC CCAGCCCAGC AGGGCGCGCG CCGCGCCGGT GTCGACCCGC 
1301 AGCCAGTCCT CCCGGTGCCC GGGAGCCCGG CCCGGAGCCG GGCGCTCCAC 
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Figure 8 

11301 CGTCAGTACA GCGTGTGGGC ACACGCCACC AGGGTGCGCA GCTCGATGTT 
11351 GAGGTAGTTG CCGTGCGCCA GCAGCCCGGT GAGCTGACCG AGCGACAGCC 
11401 AGGCGAAGTC GTCCGGTGCG TCCTCCGGGA AGTCGTGCGG GACCTCCACG 
11451 ATCACGTAGC GGTTCTGGGC GTGGAAGAAG CGCCCGCCCT CCTCGGACTG 
11501 GACGGCGTCG TAGCGCACGT CCTGAGGCGG CGCGGACAGC ACGTCCTCCA 
11551 GGTACGGCGG GCCGGGCAGC CCCCGCGGAC CGGTGTGCTC CTGTGGCCGG 
11601 CACTGGACCG TGGGGGCCAG CTCGGCGACG TTCAGGTGCC CGACGTCCAC 
11651 CCGTGCCCGC ACGAGCGCGT GCAGCACGCC GTCGACGGAC TTGACCAGCA 
11701 GCGCCATCAQ ACCCGGCAGC CGCGGCTCGA TGAGCGGCTG CGTCCAGGAG 
11751 GTGACCTCCC GGCTGCTGGC GCTGACCTCG GCGGCCATGA CCCGGAAGTG 
11801 CCGCCCGCTC TCGTGGGCGA TCTCGTGCGG CGTGCGGTAC CAGCCGTCCG 
11851 CCGTCACCGT ATCGAGCGGC ACCCGGTTCT GCACCAGCTC CCGCAGGGCG 
11901 CGCACACCCG TGAACCACGT CAGGACCTCG GCCGTCGTGT GCCGCGCCGC 
11951 ACCCGGCGAG CCGAAGAAGG AGCGCAGCAC GGGGGACGGG GCGGACGCGT 
12001 CGGCGTCCGC CGTGGGCAGG CAGGCGAGGA TGGACCGGGC GTCCATGTTG 
12051 ACCACGTTGT CCAGCATCAG CAGCCGGCGG AGCTGCCCCA GCGTCAGCCA 
12101 GCGGAAGTCC TCCCCGATGT CGAGGTCGTC GTCCGCCGCC AACTCGACGA 
12151 TCATGTTCCG GTTGCGTTTG GCCAGGAACC AGTCCGCCTG TTCGGACTGG 
12201 ATCGAGTCGA CCAGGACACG CGCCCGTCGC GGCCCCATGA ACAGGTCCAG 
12251 ATAGCGGATG TCGCGCCCCC GGTGCACCCC GGTGAAGTTG CTCCGGGTGG 
12301 CCTGCACGGT CGGCGACACC TGAAGAACGT TGACGTTCCC GGGCTCCATC 
12351 TTGGCCTGCA TCAGGAAGTG CAGCACCCCG TCGATCTCCC GCGCCACGAT 
12401 CCCGAGCAGC CCCACCTCCG GCTGCACGAT GATGGGCTGC GTCCAGCCCC 
124 51 GCTCGGGCAG CCGGTCCGTA CGGACGTGCA GCCCCTCCAC GGAGAAGAAA 
12501 CGGCCCGACG CGTGGTGCAG GTTTCCCGTA CCCGGGTGGA AGCTCCAGCC 
12551 GCGCAGCTCC GCGAAGGGAA CGCGGGACAC GTCGAAGCGC CCCGCCCGCA 
12601 GGCGTTCGGC CAGCCAGCCG GAGATGCCGT CGAACGGCGT GACCGCACTG 
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12651 TCCGCGGTGC GTGCCGACAC CAGCACCCGC CGCGCCGTGT CCACCGGGTC 
12701 ACCGGGCCGG ACCGCGTCCG CACGGCGCCG CGCGGCGCCG TGCGGGGCGG 
12751 GGGCGGATCG CGGCGGTACG GGTTCGCGGG CGGTGTCCGC GGCGGTGCGC 
12801 GGCGGGACGG GGCCGGTGCT CGTGTCCGCG GCGGTACGCG GTGGGACGGT 
12851 CCCGGTGGCC GTGTCCGCGG TGGCCGTGCC GGCGAGGGCG TCGCCGATGG 
12901 TCCGGCACAC CTCGTCCATC CGGTCGTTCA GATAGAAGTG ACCGCCGGCG 
12951 AAGGTGTGCA GGGCGAAGGG GCCCGTGGTC AGCTCCCGCC AGGCCCTCGC 
13001 CTCCTCCAGC GGGACATCGG GATCACGGTC ACCGGTGAGC ACCGTGACCG 
13051 GACAGTCCAG CGCACCGCCG GGCACATACG CGTACGTGCC CGCCGCCCGG 
13101 TAGTCGTTGC GGATCGCCGG CAGGGCCAGC CGCAGCAGCT CCTCGTCCTG 
13151 GAGGACGGCG TCCTCGGTGC CCTGAAGCGT GGCGATCTCC GCGATCAGCG 
13201 CGTCGTCGTC GAGGAGGTGG GCGACGTCCC GCCGGCGCAC CGTCGGCGCA 
13251 CGGCGGCCCG ACACCAGCAG ATGGACGGGG GAGGCCTGCC CGGAACCGCG 
13301 CAGCCGGCGC GCGACCTCGA ACGCCACCGT GGCACCCATG CTGTGCCCGA 
13351 ACAGCGCGAG CGGACGGTCG GCCCAGCGCA GGATCTCCGG CACCACCTGG 
13401 TCCACCAGGC CCGATATGGA CGGGATGAAC GGCTCGTGCC GGCGGTCCTG 
13451 GCGGCCCGGG TACTGCACCG CCAGCGCCTC CACGGTCTCG TCCAGTCCGC 
13501 GTGCCAGGGC GGCGAAGGAG GTCGCGGCGC CACCGGCGTG CGGGAAGCAG 
13551 ACCAGACGCA GTTCCGGATC CCGCACCGGG CGGTAACGGC GGACCCACAG 
13601 ACCCTCGTCC GGGTGTCCGG CCGGCGACGG GGCTCCCGGA ACGGGTGGTG 
13651 CGGAAGGGGT GCTCACGGCG GATCCAGCTC CTCGCGGTCG GGGGGACCGC 
13701 TGTCGGGGAC GGCACGTCGG GTGCGGACGT CGGGTACGGG CGTCGGGGCG 
13751 TGACGGGGAG GGACGGGGCG GTCGGTCAGT CGGTGCGCCG GGCCTCCTGC 
13801 GCGGCCTTCT TCAGCGGTTC CCACCACGCG CGGTTCTCCG CGTACCAGCG 
13851 CACCGTGTCC GCCAGGCCCG TCGTGAAGTC CGTACGCGGG GCATAGCCCA 
13901 GCTCGCCCGT GATCTTGCCG ATGTCCAGCG CGTACCGCAG GTCGTGCCCC 
13951 GGCCGGTCGG CGACGTGGCG CACCGACGAG GCGTCGGCAC CGCACAGCCC 
14001 GAGCAGCCGC TTCGTCAGCT CCCGGTTGGT CAGCTCCGTC CCGCCACCGA 
14051 TGTGGTAGAC CTCGCCCGGG CGCCCGCGGG TCGCCACCAG GCTGATCCCG 
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14101 CGGCAGTGGT CGTCCACGTG CAGCCAGTCC CGGCTGTTGC CGCCGTCGCT 
14151 GTACAGCGGC ACCGTCAGAC CGTCCAACAG GTTCGTGGCG AAGAGCGGGA 
14201 CGACCTTCTC GGGGTGCTGG TACGGGCCGT AGTTGTTGGA GCACCGGGTG 
14251 ACGACGACCG GCAGGCCGTA CGTCCGGTGG TAGGCCAGCG CCAGGAGGTC 
14301 CGACGCCGCC TTCGAGGCGG CGTACGGGGA GTTCGGCGCC AGCGGCTGCT 
14351 CCTCGCGCCA CGACCCCTCG GCGATCGAGC CGTACACCTC GTCCGTGGAG 
14 401 ACGTGGACGA ACCGGCCGGC CCCCGCCTCC ACCGCGGCCT GCAAGAGGAC 
14451 TTGCGTCCCC CGTACGTTCG TCTCGACGAA CGCCGACGCG TCGGCGATGG 
14501 AGCGGTCCAC GTGCGACTCC GCCGCGAAGT GGACCACGAC GTCCGCCCCC 
14551 CGCACGACCC GGGACATCAC CTCCGCGTCC CGGATGTCGG CGTGCACGAA 
14 601 CTCCAGCGAC GGATGGTCCG CGACCGGGTC CAGGTTGGCG AGGTTCCCGG 
14651 CATAGGTCAG CTTGTCGACC ACCACCGTCC GCGCCCCGGC CAGGTCCGGA 
14701 TACGCCCCGG CCAGCAGTTG TCTGACGAAG TGCGAGCCGA TGAAGCCCGC 
14751 ACCTCCGGTG ACCAGCAGCC GCATGGGAGC ACAGACCTTT CTTCCAGGGA 
14801 CGGGAAACGG GGAGGCGGAC GGGGACGGAG GCGAGGGCGG TGGCTATGCG 
14851 GCCGGTCCGG ACATGAGGGT CTCCGCCACG TCCATCAAGT ACCGGCCGTA 
14 901 GCTGGAGCTC TCGAGTTCAC GGCCGAGCTC GTGGCACTGC CGCGCGCTGA 
14951 TGTACCCCAT CCGCAGGGCG ATCTCCTCGA CGCAGGAGAT CCGCACGCCC 
15001 TGCCGCTGCT CCAGGAGCTG GACGTACTGC CCCGCTTGCA GCAGCGAGCT 
15051 GTGCGTGCCC ATGTCCAGCC AGGCGAACCC GCGCCCCAGT TCCGTCATAC 
15101 GGGCGCGGCC CTGCTCCAGG TACACCTTGT TGACGTCGGT GATCTCCAGC 
15151 TCGCCCCGCG GCGACGGTGT CAGCCGCCGG GCGATGTCCA CCACGCCGTT 
15201 GTCGTAGAAG TACAGCCCCG TCACCGCGAG ATGGGAGCGG GGCTTCTCCG 
15251 GCTTCTCCTC CAGGGACACC AGCCGGCCTT CCGCGTCGAC CTCGCCGACG 
15301 CCGTAGCGCC GGGGGTCCTT CACCGGGTAG CCGAACAGCT CGCAGCCGTC 
15351 CAGCCGCGCC GCGGTGGAGG CCAGCACGGA GGAGAACCCC GGACCGTGGA 
15401 AGACGTTGTC CCCCAGGATG AGGGCGACCG GGTCGTCCCC GATGTGCTCC 
15451 TCGCCGATGA GGAACGCCTC GGCGATGCCC CGGGGCTCCT CCTGCTCGGC 
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15501 GTAGCCGACA CTGATCCCGA TGCGGCTGCC GTCGCCCAGC AGCGAACGGA 

15551 ACATCTCCAA GTGCGTCTTC GACGTGATGA TCTGGATGTC CCGGATCCCC 

15601 GCCAGCATGA GCACCGACAG CGGGTAGTAG ATCATGGGCT TGTCGTAGAC 

15651 CGGCAGCAAC TGCTTGGACA GTGCCCCGGT CAGGGGGCGC AGGCGCGTGC 

15701 CGCTGCCGCC CGCCAGGATG ATGCCCTTCA TGGGCCGCCG GTCCGCCGTC 

15751 GTCTTCGTCA T 
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Figure 9 

59800 G 

59801 TGAGCCCCGC ACCCGCCACC GAGGACCCGG CCGCCGCCGG GCGCCGCCTG 
59851 CAACTGACCC GCGCAGCCCA GTGGTTCGCG GGAACCCAGG ACGACCCGTA 
59901 CGCGCTCGTC CTGCGCGCCG AGGCCACCGA CCCGGCCCCG TACGAGGAGC 
59951 GGATCCGGGC CCACGGGCCG CTCTTCCGCA GCGACCTGCT CGACACCTGG 
60001 GTCACGGCGA GCAGGGCCGT CGCCGACGAA GTGATCACCT CACCCGCCTT 
60051 CGACGGGCTC ACGGCCGACG GGCGGCGCCC CGGCGCGCGG GAACTGCCGC 
60101 TGTCCGGCAC CGCGCTCGAC GCGGACCGCG CCACATGCGC ACGG TTCGGG 
60151 GCCCTCACCG CCTGGGGCGG GCCGCTGCTG CCGGCGCCGC ACGAGCGGGC 
60201 GCTGCGCGAG TCCGCCGAAC GGCGGGCCCA CACACTCCTC GACGGGGCGG 
60251 AGGCCGCCCT GGCCGCCGAC GGCACCGTCG ACCTCGTCGA CGCGTACGCC 
60301 CGCAGGCTCC CCGCGCT GGT CCTCCGCGAA CAGCTCGGCG TGCCGGAGGA 
60351 GGCGGCGACC GCCTTCGAGG ACGCGCTGGC CGGCTGCCGC CGCACCCTGG 
60401 ACGGCGCCCT GTGCCCGCAA CTCCTCCCGG ACGCCGTGGC GGGGGTGCGC 
•60451 GCGGAAGCCG CGCTGACCGC CGTGCTGGCC TCCGCCCTGC GCGGGACTCC 
60501 GGCCGGCCGG GCCCCCGACG CCGTCGCCGC CGCCCGCACC CTGGCCGTCG 
60551 CGGCCGCCGA GCCCGCAGCC ACCCTCGTCG GCAACGCCGT ACAGGAGCTG 
60601 CTGGCGCGTC CCGCGCAGTG GGCGGAGCTC GTACGCGACC CGCGCCTCGC 
60651 GGCCGCCGCG GTGACCGAAA CGCTGCGTGT CGCCCCGCCC GTCCGCCTGG 
60701 AGCGGCGGGT CGCCCGCGAG GACACGGACA TCGCCGGGCA GCGCCTCCCC 
60751 GCCGGGGGGA GCGTCGTGAT CCTCGTCGCC GCCGTCAACC GCGCGCCCGT 
60801 ATCCGCGGGA AGCGACGCCT CCACCACCGT CCCGCACGCC GGCGGCCGGC 
60851 CCCGTACCTC CGCCCCCTCC GTCCCCTCAG CCCCCTTCGA CCTCACACGG 
60901 CCCGTGGCCG CGCCCGGGCC GTTCGGGCTC CCCGGCGACC TGCACTTCCG 
60951 CCTCGGCGGG CCCCTGGTCG GAACGGTCGC CGAAGCCGCG CTCGGTGCGC 
61001 TGGCCGCACG GCTCCCCGGT CTGCGCGCCG CCGGGCCGGC CGTGCGGCGC 
61051 CGCCGCTCAC CGGTGCTGCA CGGACACGCC CGCCTCCCCG TCGCCGTCGC 



61101 CCGGACGGCC CGTGACCTGC CCGCCACCGC ACCGCGGAAC TGAGGAGGGA 
61151 GTGCCCCGAT GCGTATCCTG CTGACGTCGT TCGCGCACAA CACGCACTAC 
61201 TACAACCTGG TCCCCCTCGG CTGGGCGCTG CGCGCCGCCG GGCACGACGT 
61251 ACGGGTCGCC AGCCAGCCCT CGCTGACCGG CACCATCACC GGCTCCGGGC- 
61301 TGACCGCCGT CCCCGTGGGC GACGACACGG CCATCGTCGA GCTGATCACC 
61351 GAGATCGGCG ACGACCTCGT CCTCTACCAG CAGGGCATGG ACTTCGTGGA 
61401 CACCCGCGAC GAGCCGCTGT CCTGGGAACA CGCCCTCGGA CAGCAGACGA 
61451 TCATGTCGGC CATGTGCTTC TCGCCGCTGA ACGGCGACAG CACCATCGAC 
61501 GACATGGTGG CGCTGGCCCG TTCCTGGAAA CCGGACCTCG TCCTGTGGGA 
61551 GCCCTTCACC TACGCGGGAC CCGTCGCCGC GCACGCCTGC GGCGCCGCCC 
61601 ACGCCCGGCT GCTGTGGGGT CCCGACGTGG TCCTCAACGC ACGGCGGCAG 
61651 TTCACCCGGC TGCTCGCCGA GCGCCCCGTC GAACAGCGCG AGGACCCGGT 
61701 CGGCGAATGG CTCACGTGGA CGCTGGAGCG CCACGGCCTC GCCGCCGACG 
61751 CGGACACGAT CGAGGAACTG TTCGCCGGGC AGTGGACGAT CGACCCCAGC 
61801 GCCGGGAGCC TGCGGCTGCC GGTCGACGGC GAGGTCGTGC CCATGCGCTT 
61851 CGTGCCGTAC AACGGCGCCT CGGTCGTCCC CGCCTGGCTC TCCGAGCCGC 
61901 CTGCCCGGCC CCGGGTCTGC GTCACCCTCG GCGTCTCCAC CCGGGAGACC 
61951 TACGGCACGG ACGGCGTCCC GTTCCACGAA CTGCTGGCCG GACTGGCCGA 
62001 CGTGGACGCC GAGATCGTCG CCACCCTCGA CGCGGGGCAG CTCCCGGACG 
62051 CCGCCGGTCT GCCCGGCAAT GTGCGCGTCG TCGACTTCGT GCCGCTGGAC 
62101 GCCCTGCTGC CGAGCTGCGC CGCGATCGTC CACCACGGAG GCGCGGGAAC 
62151 CTGTTTCACG GCCACCGTGC ACGGCGTCCC GCAGATCGTC GTGGCCTCCC 
62201 TCTGGGACGC GCCGCTGAAG GCGCACCAAC TCGCCGAGGC GGGCGCCGGG 
62251 ATCGCCCTGG ACCCCGGGGA ACTGGGCGTG GACACCCTGC GCGGCGCCGT 
62301 CGTGCGGGTG CTGGAGAGCC GCGAGATGGC CGTGGCGGCG CGTCGCCTCG 
62351 CCGACGAGAT GCTCGCCGCC CCCACCCCGG CCGCGCTCGT CCCCCGCCTC 
62401 GAACGCCTCA CCGCCGCGCA CCGCCGCGCC TGATCCCGCC AAGGAGCCCC 
62451 CATGAACCTC GAATACAGCG GCGACATCGC CCGGTTGTAC GACCTGGTCC 
62501 ACCAGGGAAA GGGCAAGGAC TACCGGGCGG AGGCCGAGGA GCTGGCCGCG 
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CGGGGCCCGC TCCCTCCTCG ACGTGGCCTG 
GGCACCTCGG CGACCTCTTC GAGGAGGTGG 
GACATGCTGG CCATCGCGCA GCGGCGCAAC 
GGGGGACATG CGGGACTTCG CCCTCGGCCG 
GCATGTTCAG TTCCATCGGG CACATGCGCG 
GCGATCGGCC GGTTCGCCGC GCACCTGCCG 
CGATCCCTGG TGGTTCCCGG AGACGTTCAC 
GCCTCGTCGA GGCCGAGGGC CGCACCATCG 
CTCGAGGACG GCGCGACCCG GATCGATGTG 
GGGGGAGGGG GTGCGGCACT TGAAGGAGAC 
GGCGTGCGCA GTACGAGGCG GCCTTCACCG 
TACCTCCCGC ACGCCGCCAC CGACCGCGGA 
CTGA 
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Figure 10 

1 MKGIILAGGS GTRLRPLTGA LSKQLLPVYD KPMIYYPLSV LMLAGIRDIQ 
51 IITSKTHLEM FRSLLGDGSR IGISVGYAEQ EEPRGIAEAF LIGEEHIGDD 
101 PVALILGDNV FHGPGFSSVL ASTAARLDGC ELFGYPVKDP RRYGVGEVDA - 
151 EGRLVSLEEK PEKPRSHLAV TGLYFYDNGV VDIARRLTPS PRGELEITDV 
201 NKVYLEQGRA RMTELGRGFA WLDMGTHSSL LQAGQYVQLL EQRQGVRISC 
251 VEEIALRMGY ISARQCHELG RELESSSYGR YLMDVAETLM SGPAA 
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Figure 11 

1 MRLLVTGGAG FIGSHFVRQL LAGAYPDLAG ARTWVDKLT YAGNLANLDP 
51 VADHPSLEFV HADIRDAEVM SRWRGADW VHFAAESHVD RSI ADAS AFV 
101 ETNVRGTQVL LQAAVEAGAG RFVHVSTDEV YGSIAEGSWR EEQPLAPNSP 
151 YAASKAASDL LALAYHRTYG LPVWTRCSN NYGPYQHPEK WPLFATNLL 
201 DGLTVPLYSD GGNSRDWLHV DDHCRGISLV ATRGRPGEVY HIGGGTELTN 
251 RELTKRLLGL CGADASSVRH VADRPGHDLR YALDIGKITG ELGYAPRTDF 
301 TTGLADTVRW YAENRAWWEP LKKAAQEARR TD 
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Figure 12 

1 VSTPSAPPVP GAPSPAGHPD EGLWVRRYRP VRDPELRLVC FPHAGGAATS 

51 FAALARGLDE TVEALAVQYP GRQDRRHEPF IPSISGLVDQ WPEILRWAD 

101 RPLALFGHSM GATVAFEVAR RLRGSGQASP VHLLVSGRRA PTVRRRDVAH 

151 LLDDDALIAE IATLQGTEDA VLQDEELLRL ALPAIRNDYR AAGTYAYVPG 

201 GALDCPVTVL TGDRDPDVPL EEARAWRELT TGPFALHTFA GGHFYLNDRM 

251 DEVCRTIGDA LAGTATADTA TGTVPPRTAA DTSTGPVPPR TAADTAREPV 

301 PPRSAPAPHG AARRRADAVR PGDPVDTARR VLVSARTADS AVTPFDGISG 

351 WLAERLRAGR FDVSRVPFAE LRGWSFHPGT GNLHHASGRF FSVEGLHVRT 

401 DRLPERGWTQ PIIVQPEVGL LGIVAREIDG VLH FLMQAKM EPGNVNVLQV 

4 51 SPTVQATRSN FTGVHRGRDI RYLDLFMGPR RARVLVDSIQ SEQADWFLAK 

501 RNRNMIVELA ADDDLDIGED FRWLTLGQLR RLLMLDNWN MDARSILACL 

551 PTADADASAP SPVLRSFFGS PGAARHTTAE VLTWFTGVRA LRELVQNRVP 

601 LDTVTADGWY RTPHEIAHES GRHFRVMAAE VSASSREVTS WTQPLIEPRL 

651 PGLMALLVKS VDGVLHALVR ARVDVGHLNV AELAPTVQCR PQEHTGPRGL 

701 PGPPYLEDVL SAPPQDVRYD AVQSEEGGRF FHAQNRYVIV EVPHDFPEDA 

751 PDDFAWLSLG QLTGLLAHGN YLNIELRTLV ACAHTLY 
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Figure 13 

1 MVNDPMPRGS GSGSVWLGG AGYVGRHVCA AFAARGRDW WGRRPPEEP 
51 MPYRCVTLDL AGTDPAALAA ALDAERPDTI VNSVGSIWGR TDEQMWSATA 
101 VPTLRLLEAL ALMSARPRLV HLGSVLEYGP VTPGGSVGAD AVPRPDTAYG 
151 RSKLAASEAV LRGTSGGWVD GWLRVSNVS GPGTPRISLL GQVAERLLAA 
201 AGTGAEAWE LSRLRAHRDY VDVRDVADAV VAAARAPAVP VAVGIGRGEA 
251 VAVRDLVGLF IEASGIPARV VERPAPGRAP GHREDWLRVD TGAARALLGW 
301 APRRSLRESV RDCWHDLVRA HRLPTTPSKH SGG 
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Figure 14 

1 VTTYVWDYLA EYQNERADLL DAVETVFASG QLVLGPSVDG FEKEFADYHG 
51 LRHCGGVDNG TNAVKLGLQA LGVGPGDEW TVSNTAAPTV VAIDGTGATP 
101 VFVDVRAEDH LMDT DQVADV ITPRTKALLP VHLYGQCVDM APLRALAEQH 
151 GLWLEDCAQ AHGARHHGEL AGTLGDAAAF SFYPTKVLGA YGDGGAVLTD 
201 DADVDRALRR LRYYGMEDVY YWQTPGHNS RLDEVQAEIL RRKLTRLDRY 
251 IEGRRAVARR YAEGLANLTG PGGLVLPSVT EGNDHVYYVY WRHPRRDDI 
301 IEALKSYGIS LNISYPWPVH TMTGFAHLGY AKGSLPVTER LADEIFSLPM 
351 YPGLAPDVQD KVIAALHEVL ATL 
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Figure 15 

1 VSPAPATEDP AAAGRRLQLT RAAQWFAGTQ DDPYALVLRA EATDPAPYEE 

51 RIRAHGPLFR SDLLDTWVTA SRAVADEVIT SPAFDGLTAD GRRPGARELP 

101 LSGTALDADR ATCARFGALT AWGGPLLPAP HERALRESAE RRAHTLLDGA 

151 EAALAADGTV DLVDAYARRL PALVLREQLG VPEEAATAFE DALAGCRRTL 

201 DGALCPQLLP DAVAGVRAEA ALTAVLASAL RGTPAGRAPD AVAAARTLAV 

251 AAAE PAATLV GNAVQELLAR PAQWAELVRD PRLAAAAVTE TLRVAPPVRL 

301 ERRVAREDTD IAGQRLPAGG SWILVAAVN RAPVSAGS DA STTVPHAGGR 

351 PRTSAPSVPS APFDLTRPVA APGPFGLPGD LHFRLGGPLV GTVAEAALGA 

401 LAARLPGLRA AGPAVRRRRS PVLHGHARLP VAVARTARDL PATAPRN 
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Figure 16 

1 MRILLTSFAH NTHYYNLVPL GWALRAAGHD VRVASQPSLT GTITGSGLTA 

51 VPVGDDTAIV ELITEIGDDL VLYQQGMDFV DTRDEPLSWE HALGQQTIMS 

101 AMCFSPLNGD STIDDMVALA RSWKPDLVLW EPFTYAGPVA AHACGAAHAR 

151 LLWGPDWLN ARRQFTRLLA ERPVEQREDP VGEWLTWTLE RHGLAADADT 

201 IEELFAGQWT IDPSAGSLRL PVDGEWPMR FVPYNGASW PAWLSEPPAR 

251 PRVCVTLGVS TRETYGTDGV PFHELLAGLA DVDAE I VATL DAGQLPDAAG 

301 LPGNVRWDF VPLDALLPSC AAIVHHGGAG TCFTATVHGV PQIWASLWD 

351 APLKAHQLAE AGAGIALDPG ELGVDTLRGA WRVLESREM AVAARRIiADE 

401 MLAAPTPAAL VPRLERLTAA HRRA 
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Figure 17 

1 MNLEYSGDIA RLYDLVHQGK GKDYRAEAEE LAALVTQRRP GARSLLDVAC 
51 GTGMHLRHLG DLFEEVAGVE MSPDMLAIAQ RRNPEAGIHR GDMRDFALGR 
101 RFDAVICMFS SIGHMRDQRE LDAAIGRFAA HLPSGGWIV DPWWFPETFT 
151 PGYVGASLVE AEGRTIARFS HSALEDGATR IDVDYLVGVP GEGVRHLKET 
201 HRITLFGRAQ YEAAFTAAGM SVEYLPHAAT DRGLFVGVQA * 
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